Abstract
Time-series discord is widely used in data mining applications to characterize anomalous subsequences in time series. Compared to some other discord search algorithms, the direct search algorithm based on the recurrence plot shows the advantage of being fast and parameter free. The direct search algorithm, however, relies on quasi-periodicity in input time series, an assumption that limits the algorithm's applicability. In this paper, we eliminate the periodicity assumption from the direct search algorithm by proposing a reference function for subsequences and a new sampling strategy based on the reference function. These measures result in a new algorithm with improved efficiency and robustness, as evidenced by our empirical evaluation.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Shoeb A, Edwards H, Connolly J et al (2004) Patient-specific seizure onset detection. Epilepsy & Behavior 5(4):483–498
Febrero M, Galeano P, González-Manteiga W (2007) A functional analysis of NOx levels: Location and scale estimation and outlier detection. Computational Statistics 22(3):411–427
van Wijk J, Van Selow E. Cluster and calendar based visualization of time series data. In Proc. the 1999 IEEE Symposium on Information Visualization, Oct. 1999, pp.4–9.
Rebbapragada U, Protopapas P, Brodley C, Alcock C (2009) Finding anomalous periodic time series. Machine learning 74(3):281–313
Rousseeuw P, Leroy A. Robust Regression and Outlier Detection. Wiley Online Library, 1987.
Dasgupta D, Forrest S. Novelty detection in time series data using ideas from immunology. In Proc. the International Conference on Intelligent Systems, Dec. 1996, pp.82–87.
Hyndman R, Ullah S (2007) Robust forecasting of mortality and fertility rates: A functional data approach. Computational Statistics & Data Analysis 51(10):4942–4956
Febrero M, Galeano P, González-Manteiga W (2008) Outlier detection in functional data by depth measures, with application to identify abnormal NOX levels. Environmetrics 19(4):331–345
Keogh E, Lin J, Fu A. HOT SAX: Efficiently finding the most unusual time series subsequence. In Proc. the 5th IEEE International Conference on Data Mining, Nov. 2005, pp.226–233.
Lin J, Keogh E, Fu A, Van Herle H. Approximations to magic: Finding unusual medical time series. In Proc. the 18th IEEE Symposium on Computer-Based Medical Systems, Jun. 2005, pp.329–334.
Bu Y, Leung T, Fu A, Keogh E, Pei J, Meshkin S. WAT: Finding top-K discords in time series database. In Proc. the 7th SIAM International Conference on Data Mining, Apr. 2007.
Yankov D, Keogh E, Rebbapragada U (2008) Disk aware discord discovery: Finding unusual time series in terabyte sized datasets. Knowledge and Information Systems 17(2):241–262
Goldberger A, Amaral L, Glass L et al (2000) Physiobank, physiotoolkit, and physionet: Components of a new research resource for complex physiologic signals. Circulation 101(23):e215–e220
Shoeb A, Guttag J. Application of machine learning to epileptic seizure detection. In Proc. the 27th International Conference on Machine Learning, Jun. 2010, pp.975–982.
Pham N, Le Q, Dang T. HOT aSAX: A novel adaptive symbolic representation for time series discords discovery. In Proc. the 2nd Intelligent Information and Database Systems, Mar. 2010, pp.113–121.
Fu A, Leung O, Keogh E, Lin J. Finding time series discords based on Haar transform. In Proc. the 2nd Advanced Data Mining and Applications, Aug. 2006, pp.31–41.
Shieh J, Keogh E. iSAX: Indexing and mining terabyte sized time series. In Proc. the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2008, pp.623–631.
Son M, Anh D. EWAT+: Finding time series discords based on new discord measure functions. In Proc. IEEE International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future, Nov. 2010, pp.1–4.
Luo W, Gallagher M. Faster and parameter-free discord search in quasi-periodic time series. In Proc. PAKDD, May 2011, pp.135–148.
Eckmann J, Kamphorst S, Ruelle D (1987) Recurrence plots of dynamical systems. Europhysics Letters 4(9):973–977
Iwanski J, Bradley E (1998) Recurrence plots of experimental data: To embed or not to embed? Chaos: An Interdisciplinary Journal of Nonlinear Science 8(4):861–871
Webber Jr C, Zbilut J. Recurrence quantification analysis of nonlinear dynamical systems. Tutorials in Contemporary Nonlinear Methods for the Behavioral Sciences, 2005, pp.26–94.
Marwan N, Carmen Romano MC, Thiel M, Kurths J (2007) Recurrence plots for the analysis of complex systems Physics Reports 438(5/6):237–329
Marwan N (2008) A historical review of recurrence plots. The European Physical Journal-Special Topics 164(1):3–12
Chatfield C. The Analysis of Time Series: An Introduction, (6th edition). Chapman & Hall/CRC, 2004.
Keogh E, Lin J, Fu A. The UCR time series discords. http://www.cs.ucr.edu/~eamonn/discords/, April 2010.
Hyndman R. Time series data library. http://data.is/TSDLdemo, April 2010.
Author information
Authors and Affiliations
Corresponding author
Additional information
Support by Australian Research Council Linkage Grant No. LP 0776417.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Luo, W., Gallagher, M. & Wiles, J. Parameter-Free Search of Time-Series Discord. J. Comput. Sci. Technol. 28, 300–310 (2013). https://doi.org/10.1007/s11390-013-1330-8
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-013-1330-8