ABSTRACT
Borealis-R is a replication-based system for both fast and
highly-available processing of data streams over wide-area networks. In Borealis-R, multiple operator replicas send outputs to downstream replicas, allowing each replica to use whichever data arrives first. To further reduce latency, replicas run without coordination, possibly processing data in different orders. Despite this flexibility, Borealis-R guarantees that applications always receive the same results as in the non-replicated, failure-free case. In addition, Borealis-R deploys replicas at select network locations to effectively improve performance as well as availability.
We demonstrate the strengths of Borealis-R using a live wide-area monitoring application. We show that Borealis-R outperforms previous solutions in terms of latency and that it uses system resources efficiently by carefully deploying and discarding replicas.
- http://www.planet-lab.org.Google Scholar
- M. Balazinska, D. Abadi, Y. Ahmad, M. Cherniack, J.-H. Hwang, W. Lindner, A. Rasin, N. Tatbul, Y. Xing, and S. Zdonik. The design of the borealis stream processing engine. In Proc. of the 2nd CIDR, 2005.Google Scholar
- M. Balazinska, H. Balakrishnan, S. Madden, and M. Stonebraker. Fault-tolerance in the borealis distributed stream processing system. In Proc. of the 2005 ACM SIGMOD, June 2005. Google ScholarDigital Library
- J.-H. Hwang, , U. Çetintemel, and S. Zdonik. A cooperative, self-configuring high-availability solution for stream processing. In Proc. of the 23th ICDE, 2007.Google ScholarCross Ref
- J.-H. Hwang, M. Balazinska, A. Rasin, U. Çetintemel, M. Stonebraker, and S. Zdonik. High-availability algorithms for distributed stream processing. In Proc. of the 21th ICDE, 2005. Google ScholarDigital Library
- J.-H. Hwang, U. Çetintemel, and S. Zdonik. Fast and highly-available stream processing over wide area networks. In Proc. of the 24th ICDE, 2008. Google ScholarDigital Library
- M. A. Shah, J. M. Hellerstein, and E. Brewer. Highly-available, fault-tolerant, parallel dataflows. In Proc. of the 2004 ACM SIGMOD, June 2004. Google ScholarDigital Library
Index Terms
- Borealis-R: a replication-transparent stream processing system for wide-area monitoring applications
Recommendations
Consistent and automatic replica regeneration
Reducing management costs and improving the availability of large-scale distributed systems require automatic replica regeneration, that is, creating new replicas in response to replica failures. A major challenge to regeneration is maintaining ...
Detouring and replication for fast and reliable internet-scale stream processing
HPDC '10: Proceedings of the 19th ACM International Symposium on High Performance Distributed ComputingiFlow is a replication-based system that can achieve both fast and reliable processing of high volume data streams on the Internet scale. iFlow uses a low degree of replication in conjunction with detouring techniques to overcome network congestion and ...
Recognizing Nondominated Coteries and wr-Coteries by Availability
Coterie is a widely accepted concept for solving the mutual exclusion problem. Nondominated coteries are an important class of coteries which have better performance than dominated coteries. The performance of a coterie is usually measured by ...
Comments