Abstract
While traditional water data stream analysis focuses mainly on single sensor node or monitoring station, having an accurate picture of the overall data patterns is more meaningful in understanding large water distribution network’s behavior and characteristics, tracking important trends, and also making informed judgments about measurement or utilization operations. In this paper, we propose a continuous summarization scheme that aims to continuously provide Representative Patterns of the complete data in large water distribution network. Our core contributions are to propose to summarize Representative Pattern for describing the spatial-temporal pattern in water distribution network and employ a parameter-free algorithm based on the Minimum Description Length (MDL) Principle to automatically split data streams into episodes for generating the series of representative patterns. Moreover, we evaluate our approaches on a real water distribution network from the Battle of the Water Sensor Network (BWSN). Experiment results show that our online summarization methods are effective, scalable and interpretable; What’s more, we discover interesting periodic time-evolving patterns on the chlorine data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ostfeld, A., Uber, J.G., Salomons, E.: Battle of water sensor networks: A design challenge for engineers and algorithms. In: WDSA (2006)
Rossman, L.A.: EPANET2 user’s manual. National Risk Management Research Laboratory: U.S. Environmental Protection Agency (2000)
Hart, D.B., Klise, K.A., McKenna, S.A., Wilson, M.P.: CANARY User’s Manual Version 4.1. Sandia National Laboratories. U.S. Environmental Protection Agency (2009)
Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. TKDD 1(1) (2007)
Ester, M., Peter Kriegel, H., Sander, J.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD 1996 (1996)
Rissanen, J.: Modeling by the shortest data description. Automatica 14, 465–471 (1978)
Sun, J., Papadimitriou, S., Yu, P.S., Faloutsos, C.: Graphscope: Parameter-free mining of large time-evolving graphs. In: KDD (2007)
Papadimitriou, S., Sun, J.: Streaming Pattern Discovery in Multiple Time-Series. In: VLDB 2005 (2005)
Sun, J., Tao, D.: Beyond streams and graphs: dynamic tensor analysis. In: KDD (2006)
Silja, T., Helena, R., Didia, C.: Water Supply System Performance for Different Pipe Materials Part I: Water Quality Analysis. Water Resour. Manage. (2008)
Zhu, Y., Shasha, D.: Statstream: Statistical monitoring of thousands of data streams in real time. In: VLDB (2002)
Sakurai, Y., Papadimitriou, S., Faloutsos, C.: Braid: Stream mining through group lag correlations. In: SIGMOD (2005)
Lin, J., Vlachos, M., Keogh, E., Gunopulos, D.: Iterative incremental clustering of time series. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 106–122. Springer, Heidelberg (2004)
Li, L., McCann, J., Pollard, N., Faloutsos, C.: DynaMMo: Mining and Summarization of
Johnson, D.S., Krishnan, S., Chhugani, J., Kumar, S., Venkatasubramanian, S.: Compressing large Boolean matrices using reordering techniques. In: VLDB (2004)
Muthukrishnan, S.: Data Streams: Algorithms and Applications (Foundations and Trends in Theoretical Computer Science), vol. 1 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xiao, H., Ma, X., Tang, S., Tian, C. (2010). Continuous Summarization of Co-evolving Data in Large Water Distribution Network. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds) Web-Age Information Management. WAIM 2010. Lecture Notes in Computer Science, vol 6184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14246-8_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-14246-8_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14245-1
Online ISBN: 978-3-642-14246-8
eBook Packages: Computer ScienceComputer Science (R0)