ABSTRACT
Network faults are often transient and hence hard to detect and difficult to resolve. Our study conducts an analysis of Kafka's network fault tolerance capabilities, one of the widely used distributed stream processing system (DSPS). Across different Kafka configurations, we observed that Kafka is fault-tolerant towards network faults to some degree, and we report observations of its shortcomings. We also define a network fault-tolerance benchmark on which other DSPSs can be evaluated.
- J. Karimov, T. Rabl, A. Katsifodimos, R. Samarev, H. Heiskanen, and V. Markl, "Benchmarking distributed stream data processing systems," 2018 IEEE 34th International Conference on Data Engineering (ICDE), Apr 2018. [Online]. Google ScholarCross Ref
- M. A. Lopez, A. G. P. Lobato, and O. C. M. B. Duarte, "A performance comparison of open-source stream processing platforms," 2016 IEEE Global Communications Conference (GLOBECOM), pp. 1--6, 2016.Google Scholar
- G. van Dongen and D. V. D. Poel, "A performance analysis of fault recovery in stream processing frameworks," IEEE Access, vol. 9, pp. 93 745--93 763, 2021.Google Scholar
Index Terms
- Benchmarking Apache Kafka under network faults
Comments