Mining patterns at each scale in massive data

Żytkow, Jan M.; Zembowicz, Robert

doi:10.1007/3-540-61286-6_139

Jan M. Żytkow¹^nAff2 &
Robert Zembowicz¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1079))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

151 Accesses

Abstract

An important but neglected aspect of data analysis is discovering phenomena at different scale in the same data. Scale plays the role analogous to error. It can be used to focus data exploration on differences that exceed the given scale (error) and to disregard those smaller. We introduce a discovery mechanism that applies to bi-variate patterns, in particular to time series. It combines search for maxima and minima with search for regularities in the form of equations. If it cannot find a regularity for all data, it uses other discovered patterns to divide data into subsets, and explores recursively each subset. Detected patterns are subtracted from data and the search continues in the residuals. Our mechanism does not skip patterns at any scale. Applied at many scales and to many data sets, it seems explosive, but it terminates surprisingly fast because of data reduction and the requirements of pattern stability and significance. We show application of our method on a time series of a half million datapoints. Our example shows that even simple data can reveal many surprising phenomena, and our method leads to fine conclusions about the environment in which they have been gathered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Langley, P., Simon, H., Bradshaw, G. & Zytkow, J. 1987. Scientific Discovery: Computational Explorations of the Creative Processes. MIT Press.
Google Scholar
Witkin, A.P. 1983. Scale-Space Filtering, in Proc. of Intl. Joint Conf. on Artificial Intelligence (IJCAI-83), AAAI Press, p.1019–1022.
Google Scholar
Zembowicz, R., & Żytkow, J.M. 1992. Discovery of Equations: Experimental Evaluation of Convergence, in Proc. of AAAI-92, AAAI Press, p.70–75.
Google Scholar
Żytkow, J.M. 1996. Automated Discovery of Empirical Laws, to appear in Fundamenta Informaticae.
Google Scholar

Download references

Author information

Jan M. Żytkow
Present address: Institute of Computer Science, Polish Academy of Sciences, Warsaw

Authors and Affiliations

Computer Science Department, Wichita State University, 67260-0083, Wichita, KS
Jan M. Żytkow & Robert Zembowicz

Authors

Jan M. Żytkow
View author publications
You can also search for this author in PubMed Google Scholar
Robert Zembowicz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jan M. Żytkow .

Editor information

Zbigniew W. Raś Maciek Michalewicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Żytkow, J.M., Zembowicz, R. (1996). Mining patterns at each scale in massive data. In: Raś, Z.W., Michalewicz, M. (eds) Foundations of Intelligent Systems. ISMIS 1996. Lecture Notes in Computer Science, vol 1079. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61286-6_139

Download citation

DOI: https://doi.org/10.1007/3-540-61286-6_139
Published: 01 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61286-5
Online ISBN: 978-3-540-68440-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics