FluteDB: An Efficient and Dependable Time-Series Database Storage Engine

Li, Chen; Li, Jianxin; Si, Jinghui; Zhang, Yangyang

doi:10.1007/978-3-319-72395-2_41

Chen Li¹⁷,
Jianxin Li¹⁷,
Jinghui Si¹⁷ &
…
Yangyang Zhang¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10658))

Included in the following conference series:

International Conference on Security, Privacy and Anonymity in Computation, Communication and Storage

3150 Accesses
3 Citations

Abstract

Recently, with the widespread use of large-scale sensor network, time-series data is vastly generated and requires to be processed. Those traditional databases, however, show their limitations in storage when handling such a large stream data. Besides, the actual dependability of databases are also difficult to be guaranteed. In this paper, we present FluteDB, an efficient and dependable time-series database storage engine, which is composed of multiple time-series enhanced sub-modules. The validations of all sub-modules have demonstrated that our improved strategies significantly outperform the existing methods in real time-series environment. Meanwhile, the complete FluteDB utilizes various measures to guarantee its dependability and achieves a higher overall storage efficiency than the state-of-the-art time-series databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Write rate is a quantitative measurement of the storage ability.
2.
Though the interval of data sampling is not specified, it will be limited in a fixed range according its own streaming and large-scale characteristics.
3.
The size of slice is determined by the average size of delta of the deltas.
4.
Our method, which is a partial sequential operation strategy, is different from LSM Tree [8].
5.
An ability to respond the possible node or Information Data Center (IDC) failure.

References

TimescaleDB: SQL made scalable for time-series data. http://www.timescale.com/papers/timescaledb.pdf
Storage Engine of InfluxData. https://docs.influxdata.com/influxdb/v1.2/concepts/storage_engine/
The world’s most popular open source database. https://www.mysql.com/
The world’s most advanced open source database. https://www.postgresql.org/
An open source in-memory data structure store. https://redis.io/
Pelkonen, T., Franklin, S., Teller, J., Huang, Q., Cavallaro, P., Meza, J., Veeraraghavan, K.: Gorilla: a fast, scalable, in-memory time series database. PVLDB 8(12), 1816–1827 (2015)
Google Scholar
Rhea, S., Wang, E., Wong, E., Atkins, E., Storer, N.: LittleTable: a time-series database and its uses. In: Proceedings of SIGMOD, pp. 125–138. ACM Press, Chicago (2017)
Google Scholar
Sears, R., Ramakrishnan, R.: bLSM: a general purpose log structured merge tree. In: Proceedings of SIGMOD, pp. 217–228. ACM Press, Scottsdale (2012)
Google Scholar
Cai, Y., Tong, H., Fan, W., Ji, P., He, Q.: Facets: fast comprehensive mining of coevolving high-order time series. In: Proceedings of the 21th SIGKDD, pp. 79–88. ACM Press, Sydney (2015)
Google Scholar
Papadopoulos, S., Datta, k., Madden, S., Mattson, T.: The TileDB array data storage manager. In: Proceedings of VLDB, pp. 349–360. Springer Press (2017)
Google Scholar
Jermaine, C., Omiecinski, E., Yee, W.G.: The partitioned exponential file for database storage management. The VLDB J. 16(4), 417–437 (2007)
Article Google Scholar
Eamonn, J.K., Kaushik, C., Sharad, M., Michael, J.P.: Locally adaptive dimensionality reduction for indexing large time series databases. In: Proceedings of SIGMOD, pp. 151–162. ACM Press, California (2001)
Google Scholar
Bassiouni, M.A.: Data compression in scientific and statistical databases. IEEE Trans. Softw. Eng. SE–11(10), 1047–1058 (2006)
Google Scholar
Podlipnig, S., Böszörmenyi, L.: A survey of web cache replacement strategies. ACM Comput. Surv. 35(4), 374–398 (2003)
Article Google Scholar

Download references

Acknowledgments

This work is supported by NSFC program (No. 61472022, 61421003), SKLSDE-2016ZX-11, and the Beijing Advanced Innovation Center for Big Data and Brain Computing.

Author information

Authors and Affiliations

Department of Computer Science, Beihang University, Beijing, China
Chen Li, Jianxin Li, Jinghui Si & Yangyang Zhang

Authors

Chen Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianxin Li
View author publications
You can also search for this author in PubMed Google Scholar
Jinghui Si
View author publications
You can also search for this author in PubMed Google Scholar
Yangyang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen Li .

Editor information

Editors and Affiliations

Guangzhou University, Guangzhou, China
Guojun Wang
Edith Kinney Gaylord Presidential Professor, University of Oklahoma, Norman, Oklahoma, USA
Mohammed Atiquzzaman
Aalto University, Espoo, Finland
Zheng Yan
University of Texas at San Antonio, San Antonio, Texas, USA
Kim-Kwang Raymond Choo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, C., Li, J., Si, J., Zhang, Y. (2017). FluteDB: An Efficient and Dependable Time-Series Database Storage Engine. In: Wang, G., Atiquzzaman, M., Yan, Z., Choo, KK. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2017. Lecture Notes in Computer Science(), vol 10658. Springer, Cham. https://doi.org/10.1007/978-3-319-72395-2_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-72395-2_41
Published: 09 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72394-5
Online ISBN: 978-3-319-72395-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics