Definitions
The management of time in streaming systems refers to how streaming computations make use of the explicit and implicit times associated with messages used in the computation.
Background
In a streaming data system, the data consists of messages that enter the system and then undergo various operations such as filtering, transformation, and aggregation. Between processing steps, these messages are transported by message streams such as those provided by Apache Kafka (Apache Software Foundation 2016c). When messages come from disparate sources via different paths, they may be reordered, delayed, or even lost entirely due to propagation delay or hardware or software failures. Processing steps themselves may also delay, reorder, or lose messages due to multi-threading or various kinds of faults in processing.
It is desirable that the processing of these messages be as reliable and timely as possible, but delay, loss, and reordering of messages can make this difficult or even...
References
Akidau T, Bradshaw R, Chambers C, Chernyak S, Fernández-Moctezuma RJ, Lax R, McVeety S, Mills D, Perry F, Schmidt E, Whittle S (2015) The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing. In: Proceedings of the VLDB endowment, vol 8, pp 1792–1803
Apache Software Foundation (2013) Storm. http://storm.apache.org
Apache Software Foundation (2016a) Beam. http://beam.apache.org
Apache Software Foundation (2016b) Flink. http://flink.apache.org
Apache Software Foundation (2016c) Kafka. http://kafka.apache.org
Apache Software Foundation (2016d) Spark structured streaming. https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
Friedman E, Tzoumas K (2016) Introduction to Apache flink: stream processing for real time and beyond. O’Reilly Media, Inc., Sebastopol
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this entry
Cite this entry
Dunning, T. (2018). Management of Time. In: Sakr, S., Zomaya, A. (eds) Encyclopedia of Big Data Technologies. Springer, Cham. https://doi.org/10.1007/978-3-319-63962-8_190-1
Download citation
DOI: https://doi.org/10.1007/978-3-319-63962-8_190-1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63962-8
Online ISBN: 978-3-319-63962-8
eBook Packages: Living Reference MathematicsReference Module Computer Science and Engineering