skip to main content
10.1145/3698300.3698312acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbdtConference Proceedingsconference-collections
research-article

A Memory Optimization Method for High Cardinality Based on InfluxDB

Published: 14 December 2024 Publication History

Abstract

The time series databases support various monitoring systems and data analysis systems. However, high cardinality may be causing memory issues. when handling a heavy workload, time series databases consume a lot of memory, resulting in a drop in read and write performance. In order to solve this problem, a memory optimization method for high cardinality based on InfluxDB is proposed. This method first reduces the number of time series data in the buffer, that is, by reducing the number of time series data decompressed by InfluxDB from the disk, thereby reducing the memory overhead required for buffering temporary data. Secondly, it delays the reading of non-essential data during the current query phase. By utilizing the data skewness between timelines, the reading of non-essential data blocks can be postponed, reducing the number of timelines present in memory at any given moment and thereby decreasing memory overhead. Experiments show that in high cardinality scenarios with a concurrency of 100,000, the optimized InfluxDB memory overhead is only 23.9% of the original InfluxDB system.

References

[1]
Brillinger D R. Time Series: Data Analysis and Theory[M]. Philadelphia, PA: SIAM, 2001
[2]
InfluxData. InfluxDB time series database[EB/OL]. [2023-04-10]. https://www.influxdata.com
[3]
Kumar, S., Vasthimal, D.K. Cardinality Based Rate Limiting System for Time-Series Data. Cloud Computing – CLOUD 2020. Lecture Notes in Computer Science(pp 250–260), vol 12403. Springer, Cham.
[4]
Véril M, Scemama A, et al. QUESTDB: a database of highly accurate excitation energies for the electronic structure community. WIREs Comput. Mol. Sci. 2021
[5]
Patel A. Evaluating the performance of NoSQL and Time Series databases using TSBS [J]. 2023.
[6]
Douglas, K., and Douglas, S., POSTGRESQL, SAMS Publishing, Indianapolis, IN, 2003.
[7]
Comer D. Ubiquitous B-tree[J]. ACM Computing Surveys, 1979, 11(2): 121−137
[8]
TAOS Data. TDengine is an open source, high-performance, cloud native time-series database[EB/OL]. [2023-04-10]. https://tdengine.com/
[9]
The VictoriaMetrics Team. VictoriaMetrics: The high-performance, open source time series database & monitoring solution[EB/OL]. [2023-04-10]. https://victoriametrics.com/
[10]
VictoriaMetrics. VictoriaMetrics cardinality explorer[EB/OL]. [2023-04-10]. https://victoriametrics.com/blog/cardinality-explorer/
[11]
InfluxData. Time series index (TSI) overview[EB/OL]. [2023-04 10]. https://docs.influxdata.com/influxdb/v1.8/concepts/time-series index/
[12]
O'neil E J, O'neil P E, Weikum G. The LRU-K page replacement algorithm for database disk buffering[J]. ACM SIGMOD Record, 1993, 22(2): 297−306
[13]
Dix P. Announcing InfluxDB IOx - The future core of InfluxDB built with rust and arrow[EB/OL]. [2023-04-10].https://www.influxdata. com/blog/announcing-influxdb-iox/

Index Terms

  1. A Memory Optimization Method for High Cardinality Based on InfluxDB

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICBDT '24: Proceedings of the 2024 7th International Conference on Big Data Technologies
    September 2024
    140 pages
    ISBN:9798400717512
    DOI:10.1145/3698300
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 December 2024

    Check for updates

    Author Tags

    1. High Cardinality
    2. Memory Optimization
    3. Time Series Databases, InfluxDB

    Qualifiers

    • Research-article

    Funding Sources

    • Sichuan Science and Technology Program

    Conference

    ICBDT 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 28
      Total Downloads
    • Downloads (Last 12 months)28
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media