Abstract
Today many internet messaging platforms are built based on publish-subscribe event delivery models that needs to deliver large amounts of messages to ubiquitous clients in a timely manner. To deal with the huge amount of messages reliably, fault tolerance and disaster recovery, e.g. power outages, network interruptions, and software failures, have become vital issues. This paper studies the issue of disaster recovery for the popular Apache Kafka message delivery system. We adopt the strategy of cooperative redundant among multi-region Kafka clusters. By using this approach, the internet messaging applications connect to a local Kafka consumer gateway which uses multi-threading to connect multiple Kafka clusters in different regions. The gateway automatically selects active cluster thereby achieves location transparency and fault tolerance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Apache Kafka. https://kafka.apache.org/documentation/
Active-active and active-passive failover. https://docs.aws.amazon.com/Route53/latest/DeveloperGuide/dns-failover-types.html
Active/Active Architecture. https://docs.cloudera.com/csp/2.0.1/srm-overview/topics/srm-active-active-arch.html
Soman, C.: uReplicator: Uber Engineering’s Robust Apache Kafka Replicator (2022). https://eng.uber.com/ureplicator-apache-kafka-replicator/
Apache Kafka's MirrorMaker. https://docs.confluent.io/4.0.0/multi-dc/mirrormaker.html
uReplicater. https://www.163.com/dy/article/G1E7A18A0511D3QS.html
Acknowledgments
This study is financial support in part by Ministry of Science and Technology, Taiwan under the grants MOST 108-2221-E-029-009 and MOST 109-2221-E-029-017-MY2.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, LP., Yei, LF., Chen, YR. (2022). An Efficient Disaster Recovery Mechanism for Multi-region Apache Kafka Clusters. In: Barolli, L. (eds) Innovative Mobile and Internet Services in Ubiquitous Computing. IMIS 2022. Lecture Notes in Networks and Systems, vol 496. Springer, Cham. https://doi.org/10.1007/978-3-031-08819-3_31
Download citation
DOI: https://doi.org/10.1007/978-3-031-08819-3_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08818-6
Online ISBN: 978-3-031-08819-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)