Skip to main content

Mining patterns at each scale in massive data

  • Communications Session 1B Learning and Discovery Systems
  • Conference paper
  • First Online:
Foundations of Intelligent Systems (ISMIS 1996)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1079))

Included in the following conference series:

  • 151 Accesses

Abstract

An important but neglected aspect of data analysis is discovering phenomena at different scale in the same data. Scale plays the role analogous to error. It can be used to focus data exploration on differences that exceed the given scale (error) and to disregard those smaller. We introduce a discovery mechanism that applies to bi-variate patterns, in particular to time series. It combines search for maxima and minima with search for regularities in the form of equations. If it cannot find a regularity for all data, it uses other discovered patterns to divide data into subsets, and explores recursively each subset. Detected patterns are subtracted from data and the search continues in the residuals. Our mechanism does not skip patterns at any scale. Applied at many scales and to many data sets, it seems explosive, but it terminates surprisingly fast because of data reduction and the requirements of pattern stability and significance. We show application of our method on a time series of a half million datapoints. Our example shows that even simple data can reveal many surprising phenomena, and our method leads to fine conclusions about the environment in which they have been gathered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Langley, P., Simon, H., Bradshaw, G. & Zytkow, J. 1987. Scientific Discovery: Computational Explorations of the Creative Processes. MIT Press.

    Google Scholar 

  • Witkin, A.P. 1983. Scale-Space Filtering, in Proc. of Intl. Joint Conf. on Artificial Intelligence (IJCAI-83), AAAI Press, p.1019–1022.

    Google Scholar 

  • Zembowicz, R., & Żytkow, J.M. 1992. Discovery of Equations: Experimental Evaluation of Convergence, in Proc. of AAAI-92, AAAI Press, p.70–75.

    Google Scholar 

  • Żytkow, J.M. 1996. Automated Discovery of Empirical Laws, to appear in Fundamenta Informaticae.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jan M. Żytkow .

Editor information

Zbigniew W. Raś Maciek Michalewicz

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Żytkow, J.M., Zembowicz, R. (1996). Mining patterns at each scale in massive data. In: Raś, Z.W., Michalewicz, M. (eds) Foundations of Intelligent Systems. ISMIS 1996. Lecture Notes in Computer Science, vol 1079. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61286-6_139

Download citation

  • DOI: https://doi.org/10.1007/3-540-61286-6_139

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-61286-5

  • Online ISBN: 978-3-540-68440-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics