Abstract
The focus of this paper is anomaly detection and pattern discovery in spatio-temporal tensor streams. As an example, sensor networks comprising of multiple individual sensor streams generate spatio-temporal data, which can be captured in tensor streams. Anomaly detection in such data is considered challenging because of the potential complexity and high order of the tensor data from spatio-temporal sources such as sensor networks. In this paper, we propose an innovative approach for anomaly detection and pattern discovery in such tensor streams. We model the tensor stream itself as a single incremental tensor, for example representing the entire sensor network, instead of dealing with each individual tensor in the stream separately. Such a model provides a global view of the tensor stream and enables subsequent in-depth analysis of it. The proposed approach is designed for online analysis of tensor streams with fast runtime. We evaluate our approach for detecting anomalies under different conditions and for identifying complex data patterns. We also compare the proposed approach with the existing tensor stream analysis method (Sun et al. in ACM Trans Knowl Discov Data 2, 2008). Our evaluation uses synthetic data as well as real-world data showing the efficiency and effectiveness of the proposed approach.







Similar content being viewed by others
References
Aggarwal CC (2005) On abnormality detection in spuriously populated data streams. In: SDM’05, p 1
Ahmed T, Coates M, Lakhina A (2007) Multivariate online anomaly detection using kernel recursive least squares. In: Proceedings IEEE Infocom
Babcock B, Babu S, Datar M, Motwani R, Widom J (2002) Models and issues in data stream systems. In: Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems, PODS ’02. ACM, New York, pp 1–16
Bader BW, Harshman RA, Kolda TG (2007) Temporal analysis of semantic graphs using ASALSAN. In: ‘ICDM 2007: Proceedings of the 7th IEEE international conference on data mining, pp 33–42
Budhaditya S, Pham D-S, Lazarescu M, Venkatesh S (2009) Effective anomaly detection in sensor networks data streams. In: Proceedings of the 2009 ninth IEEE international conference on data mining, ICDM ’09. IEEE Computer Society, Washington, DC, USA, pp 722–727
CATT Lab traffic databases, http://www.cattlab.umd.edu/ (n.d.)
Chew PA, Bader BW, Kolda TG, Abdelali A (2007) Cross-language information retrieval using PARAFAC2. In: KDD ’07: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining. ACM Press, pp 143–152
DARPA Intrusion Detection Data Sets from MIT Lincoln Laboratory, http://www.ll.mit.edu/mission/communications/cyber/CSTcorpora/ideval/data/ (n.d.)
Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41:391–407
Dey S, Janeja VP, Gangopadhyay A (2009) Temporal neighborhood discovery through unequal depth binning. In: IEEE international conference on data mining (ICDM’09)
Huang H, Ding C, Luo D, Li T (2008) Simultaneous tensor subspace selection and clustering: the equivalence of high order svd and k-means clustering. In: KDD ’08: Proceeding of the 14th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, NY, USA, pp 327–335
Janeja VP, Adam NR, Atluri V, Vaidya J (2010) Spatial neighborhood based anomaly detection in sensor datasets. Data Min Knowl Discov 20:221–258
Kolda TG, Bader BW, Kenny JP (2005) Higher-order web link analysis using multilinear algebra. In: Proceedings of IEEE Computer Society, IEEE International Conference Data Mining, pp 242–249
Kolda TG, Bader BW (2009) Tensor decompositions and applications. SIAM Rev 51(3):455–500
Lathauwer LD, Moor BD, Vandewalle J (2000) A multilinear singular value decomposition. SIAM J Matrix Anal Appl 21:1253–1278
Muthukrishnan S (2005) Data streams: algorithms and applications. Found Trends Theor Comput Sci 1:117–236
Papadimitriou S, Sun J, Faloutsos C (2005) Streaming pattern discovery in multiple time-series. In: Proceedings of the 31st international conference on very large data bases, VLDB ’05, VLDB endowment, pp 697–708
Pokrajac D, Lazarevic A, Latecki LJ (2007) Incremental local outlier detection for data streams. In: CIDM’07, pp 504–515
Shashua A, Levin A (2001) Linear image coding for regression and classification using the tensor-rank principle. In: IEEE Conference on computer vision and pattern recognition
Sun J, Papadimitriou S, Yu PS (2006) Window-based tensor analysis on high-dimensional and multi-aspect streams. In: IEEE international conference on data mining, pp 1076–1080
Sun J, Tao D, Papadimitriou S, Yu PS, Faloutsos C (2008) Incremental tensor analysis: theory and applications. ACM Trans Knowl Discov Data 2(3):1–37. doi:10.1145/1409620.1409621
Tucker L (1966) Some mathematical notes on three-mode factor analysis. Psychometrika 31(3):279–311
Vasilescu MAO, Terzopoulos D (2002) Multilinear image analysis for facial recognition. Int Conf Pattern Recognit 2:20511
Wang H, Ahuja N (2003) Facial expression decomposition. In: ICCV, pp 958–965
Zha H, Simon HD (1999) On updating problems in latent semantic indexing. SIAM J Sci Comput 21(2):782–791
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shi, L., Gangopadhyay, A. & Janeja, V.P. STenSr: Spatio-temporal tensor streams for anomaly detection and pattern discovery. Knowl Inf Syst 43, 333–353 (2015). https://doi.org/10.1007/s10115-014-0733-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-014-0733-3