skip to main content
10.1145/3210284.3210297acmconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
research-article

FogStore: A Geo-Distributed Key-Value Store Guaranteeing Low Latency for Strongly Consistent Access

Published: 25 June 2018 Publication History

Abstract

We design Fogstore, a key-value store for event-based systems, that exploits the concept of relevance to guarantee low-latency access to relevant data with strong consistency guarantees, while providing tolerance from geographically correlated failures. Distributed event-based processing pipelines are envisioned to utilize the resources of densely geo-distributed infrastructures for low-latency responses - enabling real-time applications. Increasing complexity of such applications results in higher dependence on state, which has driven the incorporation of state-management as a core functionality of contemporary stream processing engines a la Apache Flink and Samza. Processing components executing under the same context (like location) often produce information that may be relevant to others, thereby necessitating shared state and an out-of-band globally-accessible data-store. Efficient access to application state is critical for overall performance, thus centralized data-stores are not a viable option due to the high-latency of network traversals. On the other hand, a highly geo-distributed datastore with low-latency implemented with current key-value stores would necessitate degrading client expectation of consistency as per the PACELC theorem. In this paper we exploit the notion of contextual relevance of events (data) in situation-awareness applications - and offer differential consistency guarantees for clients based on their context. We highlight important systems concerns that may arise with a highly geo-distributed system and show how Fogstore's design tackles them. We present, in detail, a prototype implementation of Fogstore's mechanisms on Apache Cassandra and a performance evaluation. Our evaluations show that Fogstore is able to achieve the throughput of eventually consistent configurations while serving data with strong consistency to the contextually relevant clients.

References

[1]
DataStax configuring data consistency in apache cassandra. https://docs.datastax.com/en/cassandra/2.1/cassandra/dml/dml_config_consistency_c.html. Accessed: 2018-02-19.
[2]
Kafka connect. https://docs.confluent.io/current/connect/intro.html. Accessed: 2018-03-07.
[3]
Lorenzo Affetti. Consistent stream processing: Doctoral symposium. In Proceedings of the 11th ACM International Conference on Distributed and Event-based Systems, DEBS '17, pages 355--358, New York, NY, USA, 2017. ACM.
[4]
Austin Appleby. Murmurhash 2.0, 2008.
[5]
Flavio Bonomi, Rodolfo Milito, Jiang Zhu, and Sateesh Addepalli. Fog computing and its role in the internet of things. In Proceedings of the first edition of the MCC workshop on Mobile cloud computing, pages 13--16. ACM, 2012.
[6]
Mohamed Ben Brahim, Wassim Drira, Fethi Filali, and Noureddine Hamdi. Spatial data extension for cassandra nosql database. Journal of Big Data, 3(1):11, 2016.
[7]
Paris Carbone, Asterios Katsifodimos, Stephan Ewen, Volker Markl, Seif Haridi, and Kostas Tzoumas. Apache flink: Stream and batch processing in a single engine. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 36(4), 2015.
[8]
Valeria Cardellini, Vincenzo Grassi, Francesco Lo Presti, and Matteo Nardelli. On qos-aware scheduling of data stream applications over fog computing infrastructures. In Computers and Communication (ISCC), 2015 IEEE Symposium on, pages 271--276. IEEE, 2015.
[9]
Bastien Confais, Adrien Lebre, and Benoît Parrein. Performance analysis of object store systems in a fog and edge computing infrastructure. In Transactions on Large-Scale Data-and Knowledge-Centered Systems XXXIII, pages 40--79. Springer, 2017.
[10]
Rodrigo S Couto, Stefano Secci, Miguel Elias M Campista, and Luís Henrique MK Costa. Latency versus survivability in geo-distributed data center design. In Global Communications Conference (GLOBECOM), 2014 IEEE, pages 1102--1107. IEEE, 2014.
[11]
Kirak Hong, David Lillethun, Umakishore Ramachandran, Beate Ottenwälder, and Boris Koldehofe. Mobile fog: A programming model for large-scale applications on the internet of things. In Proceedings of the second ACM SIGCOMM workshop on Mobile cloud computing, pages 15--20. ACM, 2013.
[12]
Ovidiu-Cristian Marcu, Radu Tudoran, Bogdan Nicolae, Alexandru Costan, Gabriel Antoniu, and María S Pérez-Hernández. Exploring shared state in key-value store for window-based multi-pattern streaming analytics. In Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pages 1044--1052. IEEE Press, 2017.
[13]
Marlon McKenzie, Hua Fan, and Wojciech Golab. Fine-tuning the consistency-latency trade-off in quorum-replicated distributed storage systems. In Big Data (Big Data), 2015 IEEE International Conference on, pages 1708--1717. IEEE, 2015.
[14]
Shadi A Noghabi, Kartik Paramasivam, Yi Pan, Navina Ramesh, Jon Bringhurst, Indranil Gupta, and Roy H Campbell. Samza: stateful scalable stream processing at linkedin. Proceedings of the VLDB Endowment, 10(12):1634--1645, 2017.
[15]
Muntasir Raihan Rahman, Lewis Tseng, Son Nguyen, Indranil Gupta, and Nitin Vaidya. Characterizing and adapting the consistency-latency tradeoff in distributed key-value stores. ACM Transactions on Autonomous and Adaptive Systems (TAAS), 11(4):20, 2017.
[16]
Hans Sagan. HilbertâĂŹs space-filling curve. In Space-filling curves, pages 9--30. Springer, 1994.
[17]
Enrique Saurez, Kirak Hong, Dave Lillethun, Umakishore Ramachandran, and Beate Ottenwälder. Incremental deployment and migration of geo-distributed situation awareness applications in the fog. In Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems, pages 258--269. ACM, 2016.
[18]
Jeff Shute, Radek Vingralek, Bart Samwel, Ben Handy, Chad Whipkey, Eric Rollins, Mircea Oancea, Kyle Littlefield, David Menestrina, Stephan Ellner, et al. F1: A distributed sql database that scales. Proceedings of the VLDB Endowment, 6(11):1068--1079, 2013.
[19]
Subhajit Sidhanta, Wojciech Golab, Supratik Mukhopadhyay, and Saikat Basu. Adaptable sla-aware consistency tuning for quorum-replicated datastores. IEEE Transactions on Big Data, 3(3):248--261, 2017.
[20]
Yuuichi Teranishi, Ryohei Banno, and Toyokazu Akiyama. Scalable and locality-aware distributed topic-based pub/sub messaging for iot. In Global Communications Conference (GLOBECOM), 2015 IEEE, pages 1--7. IEEE, 2015.
[21]
Philip Wette, Martin Draxler, Arne Schwabe, Felix Wallaschek, Mohammad Hassan Zahraee, and Holger Karl. Maxinet: Distributed emulation of software-defined networks. In Networking Conference, 2014 IFIP, pages 1--9. IEEE, 2014.
[22]
Victor Zakhary, Faisal Nawab, Divyakant Agrawal, and Amr El Abbadi. Global-scale placement of transactional data stores. In Proceedings of the 21th International Conference on Extending Database Technology, EDBT, pages 26--29, 2018.

Cited By

View all
  • (2024)Databases in Edge and Fog Environments: A SurveyACM Computing Surveys10.1145/366600156:11(1-40)Online publication date: 8-Jul-2024
  • (2024)PathFSProceedings of the 7th International Workshop on Edge Systems, Analytics and Networking10.1145/3642968.3654822(55-60)Online publication date: 22-Apr-2024
  • (2024)Large-Scale Causal Data Replication for Stateful Edge Applications2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS60910.2024.00028(209-220)Online publication date: 23-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DEBS '18: Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems
June 2018
289 pages
ISBN:9781450357821
DOI:10.1145/3210284
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Distributed key-value stores
  2. context awareness
  3. edge computing
  4. latency-consistency trade-off

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

DEBS '18

Acceptance Rates

DEBS '18 Paper Acceptance Rate 12 of 31 submissions, 39%;
Overall Acceptance Rate 145 of 583 submissions, 25%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)58
  • Downloads (Last 6 weeks)5
Reflects downloads up to 30 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Databases in Edge and Fog Environments: A SurveyACM Computing Surveys10.1145/366600156:11(1-40)Online publication date: 8-Jul-2024
  • (2024)PathFSProceedings of the 7th International Workshop on Edge Systems, Analytics and Networking10.1145/3642968.3654822(55-60)Online publication date: 22-Apr-2024
  • (2024)Large-Scale Causal Data Replication for Stateful Edge Applications2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS)10.1109/ICDCS60910.2024.00028(209-220)Online publication date: 23-Jul-2024
  • (2024)Coordinating Compaction Between LSM-Tree Based Key-Value Stores for Edge Federation2024 IEEE 17th International Conference on Cloud Computing (CLOUD)10.1109/CLOUD62652.2024.00054(419-429)Online publication date: 7-Jul-2024
  • (2024)Storage Technology Trends and DevelopmentData Storage Architectures and Technologies10.1007/978-981-97-3534-1_14(379-428)Online publication date: 28-Aug-2024
  • (2023)Data Management for mobile applications dependent on geo-located dataProceedings of the 10th Workshop on Principles and Practice of Consistency for Distributed Data10.1145/3578358.3591334(70-76)Online publication date: 8-May-2023
  • (2023)Eventually Consistent Configuration Management in Fog Systems With CRDTs2023 IEEE International Conference on Cloud Engineering (IC2E)10.1109/IC2E59103.2023.00032(220-221)Online publication date: 25-Sep-2023
  • (2023)The SPEC-RG Reference Architecture for The Compute Continuum2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid57682.2023.00051(469-484)Online publication date: May-2023
  • (2023)AS-cast: Lock Down the Traffic of Decentralized Content Indexing at the EdgeAlgorithms and Architectures for Parallel Processing10.1007/978-3-031-22677-9_23(433-454)Online publication date: 11-Jan-2023
  • (2023)Managing data replication and distribution in the fog with FReDSoftware: Practice and Experience10.1002/spe.323753:10(1958-1981)Online publication date: 11-Jul-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media