Skip to main content

Data Mining for Maximal Frequent Patterns in Sequence Groups

  • Chapter
  • First Online:
Intelligent Data Mining

Part of the book series: Studies in Computational Intelligence ((SCI,volume 5))

Abstract

In this paper, we give a general treatment for mining some kinds of sequences such as customer sequences, document sequences, and DNA sequences. Large collections of transaction, document, and genomic information have been accumulated in recent years, and embedded latently in it there is potentially significant knowledge for exploitation in the retailing industry, in information retrieval, and in medicine and the pharmaceutical industry, respectively. The approach taken here to the distillation of such knowledge is to detect strings in sequences which appear frequently, either within a given sequence (e.g. for a particular customer, document, or patient) or across sequences (e.g. from different customers, documents, or patients sharing a particular transaction, information retrieval, or medical diagnosis; respectively). Patterns are strings that occur very frequently. These concepts are generalisation of the concept of motifs for DNA sequences. There are interesting differences between the 3 applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Editor information

Da Ruan Guoqing Chen Etienne E. Kerre Geert Wets

Rights and permissions

Reprints and permissions

About this chapter

Cite this chapter

Guan, J., Bell, D., Liu, D. Data Mining for Maximal Frequent Patterns in Sequence Groups. In: Ruan, D., Chen, G., E. Kerre, E., Wets, G. (eds) Intelligent Data Mining. Studies in Computational Intelligence, vol 5. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11004011_7

Download citation

  • DOI: https://doi.org/10.1007/11004011_7

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26256-5

  • Online ISBN: 978-3-540-32407-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics