Abstract
An important but neglected aspect of data analysis is discovering phenomena at different scale in the same data. Scale plays the role analogous to error. It can be used to focus data exploration on differences that exceed the given scale (error) and to disregard those smaller. We introduce a discovery mechanism that applies to bi-variate patterns, in particular to time series. It combines search for maxima and minima with search for regularities in the form of equations. If it cannot find a regularity for all data, it uses other discovered patterns to divide data into subsets, and explores recursively each subset. Detected patterns are subtracted from data and the search continues in the residuals. Our mechanism does not skip patterns at any scale. Applied at many scales and to many data sets, it seems explosive, but it terminates surprisingly fast because of data reduction and the requirements of pattern stability and significance. We show application of our method on a time series of a half million datapoints. Our example shows that even simple data can reveal many surprising phenomena, and our method leads to fine conclusions about the environment in which they have been gathered.
Preview
Unable to display preview. Download preview PDF.
References
Langley, P., Simon, H., Bradshaw, G. & Zytkow, J. 1987. Scientific Discovery: Computational Explorations of the Creative Processes. MIT Press.
Witkin, A.P. 1983. Scale-Space Filtering, in Proc. of Intl. Joint Conf. on Artificial Intelligence (IJCAI-83), AAAI Press, p.1019–1022.
Zembowicz, R., & Żytkow, J.M. 1992. Discovery of Equations: Experimental Evaluation of Convergence, in Proc. of AAAI-92, AAAI Press, p.70–75.
Żytkow, J.M. 1996. Automated Discovery of Empirical Laws, to appear in Fundamenta Informaticae.
Author information
Authors and Affiliations
Corresponding author
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Żytkow, J.M., Zembowicz, R. (1996). Mining patterns at each scale in massive data. In: Raś, Z.W., Michalewicz, M. (eds) Foundations of Intelligent Systems. ISMIS 1996. Lecture Notes in Computer Science, vol 1079. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61286-6_139
Download citation
DOI: https://doi.org/10.1007/3-540-61286-6_139
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61286-5
Online ISBN: 978-3-540-68440-4
eBook Packages: Springer Book Archive