Processing Sequential Patterns in Relational Databases

Shang, Xuequn; Sattler, Kai-Uwe

doi:10.1007/11546849_43

Xuequn Shang¹⁸ &
Kai-Uwe Sattler¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3589))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

Abstract

Database integration of data mining has gained popularity and its significance is well recognized. However, the performance of SQL based data mining is known to fall behind specialized implementation since the prohibitive nature of the cost associated with extracting knowledge, as well as the lack of suitable declarative query language support. Recent studies have found that for association rule mining and sequential pattern mining with carefully tuned SQL formulations it is possible to achieve performance comparable to systems that cache the data in files outside the DBMS. However most of the previous pattern mining methods follow the method of Apriori which still encounters problems when a sequential database is large and/or when sequential patterns to be mined are numerous and long.

In this paper, we present a novel SQL based approach that we recently proposed, called Prospad (PROjection Sequential PAttern Discovery). Prospad fundamentally differs from an Apriori-like candidate set generation-and-test approach. This approach is a pattern growth-based approach without candidate generation. It grows longer patterns from shorter ones by successively projecting the sequential table into subsequential tables. Since a projected table for a sequential pattern i contains all and only necessary information for mining the sequential patterns that can grow from i, the size of the projected table usually reduces quickly as mining proceeds to longer patterns. Moreover, avoiding creating and dropping cost of some temporary tables, depth first approach is used to facilitate the projecting process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A sequential tree approach for incremental sequential pattern mining

Article 25 November 2016

An Efficient Approach for Mining Sequential Pattern

Tree-Miner: Mining Sequential Patterns from SP-Tree

References

Agrawal, R., Srikant, R.: Mining sequential patterns. In: Eleventh International Conference on Data Engineering, Taipei, Taiwan, pp. 3–14. IEEE Computer Society Press, Los Alamitos (1995)
Chapter Google Scholar
Antunes, C., Oliveira, A.L.: Sequential pattern mining algorithms: Trade-offs between speed and memory. In: Second International Workshop on Mining Graphs, Trees and Sequences, Pisa, Italy (September 2004)
Google Scholar
Ayres, J., Flannick, J., Gehrke, J., Yiu, T.: Sequential pattern mining using a bitmap representation. In: KDD 2002: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 429–435. ACM Press, New York (2002)
Chapter Google Scholar
Chaudhuri, S.: Data mining and database systems: Where is the intersection? In: Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, vol. 21(1) (March 1998)
Google Scholar
Han, J., Fu, Y., Wang, W.: Dmql: A data mining query language for relational database. In: Proc. of the 1996 SIGMOD workshop on research issues on data mining and knowledge discovery, Montreal, Canada (1996)
Google Scholar
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.: Prefixspan: Mining sequential patterns efficiently by prefixprojected pattern growth. In: Proc. 2001 Int. Conf. Data Engineering (ICDE 2001), Heidelberg, Germany, April 2001, pp. 215–224 (2001)
Google Scholar
Sarawagi, S., Thomas, S., Agrawal, R.: Integrating association rule mining with relational database systems: Alternatives and implications. In: Proc. 1998 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD 1998), Seattle, WA, June, pp. 345–354. ACM Press, New York (1998)
Google Scholar
Shang, X., Sattler, K.: Depth-first frequent itemset mining in relational databases. In: Proc. ACM Symposium on Applied Computing SAC 2005, New Mexico, USA (2005)
Google Scholar
Srikant, R., Agrawal, R.: Mining sequential patterns: Generalizations and performance improvements. In: Apers, P.M.G., Bouzeghoub, M., Gardarin, G. (eds.) EDBT 1996. LNCS, vol. 1057, pp. 3–17. Springer, Heidelberg (1996)
Chapter Google Scholar
Thomas, S., Sarawagi, S.: Mining generalized association rules and sequential patterns using SQL queries. In: Knowledge Discovery and Data Mining, pp. 344–348 (1998)
Google Scholar
Wojciechowski, M.: Mining various patterns in sequential data in an sql-like manner. In: ADBIS (Short Papers), pp. 131–138 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Magdeburg, P.O.BOX 4120, 39106, Magdeburg, Germany
Xuequn Shang
Department of Computer Science and Automation, Technical University of Ilmenau,
Kai-Uwe Sattler

Authors

Xuequn Shang
View author publications
You can also search for this author in PubMed Google Scholar
Kai-Uwe Sattler
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstr. 9-11/188, A-1040, Wien, Austria
A Min Tjoa
Department of Software and Computing Systems, University of Alicante, Spain
Juan Trujillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shang, X., Sattler, KU. (2005). Processing Sequential Patterns in Relational Databases. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science, vol 3589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546849_43

Download citation

DOI: https://doi.org/10.1007/11546849_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28558-8
Online ISBN: 978-3-540-31732-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Processing Sequential Patterns in Relational Databases

Abstract

Access this chapter

Preview

Similar content being viewed by others

A sequential tree approach for incremental sequential pattern mining

An Efficient Approach for Mining Sequential Pattern

Tree-Miner: Mining Sequential Patterns from SP-Tree

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Processing Sequential Patterns in Relational Databases

Abstract

Access this chapter

Preview

Similar content being viewed by others

A sequential tree approach for incremental sequential pattern mining

An Efficient Approach for Mining Sequential Pattern

Tree-Miner: Mining Sequential Patterns from SP-Tree

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation