Abstract
In this paper, we give a general treatment for mining some kinds of sequences such as customer sequences, document sequences, and DNA sequences. Large collections of transaction, document, and genomic information have been accumulated in recent years, and embedded latently in it there is potentially significant knowledge for exploitation in the retailing industry, in information retrieval, and in medicine and the pharmaceutical industry, respectively. The approach taken here to the distillation of such knowledge is to detect strings in sequences which appear frequently, either within a given sequence (e.g. for a particular customer, document, or patient) or across sequences (e.g. from different customers, documents, or patients sharing a particular transaction, information retrieval, or medical diagnosis; respectively). Patterns are strings that occur very frequently. These concepts are generalisation of the concept of motifs for DNA sequences. There are interesting differences between the 3 applications.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Author information
Authors and Affiliations
Editor information
Rights and permissions
About this chapter
Cite this chapter
Guan, J., Bell, D., Liu, D. Data Mining for Maximal Frequent Patterns in Sequence Groups. In: Ruan, D., Chen, G., E. Kerre, E., Wets, G. (eds) Intelligent Data Mining. Studies in Computational Intelligence, vol 5. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11004011_7
Download citation
DOI: https://doi.org/10.1007/11004011_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26256-5
Online ISBN: 978-3-540-32407-2
eBook Packages: EngineeringEngineering (R0)