Abstract
The technique of classification can sort data into various categories for data mining studies. The demand for sequence data classification has increased with the development of information technology. Several applications involve decision prediction based on sequence data, but the traditional classification methods are unsuitable for sequence data. Thus, this paper proposes a Pattern Coverage Rate-based Sequence Classification Model (PCRSCM) to integrate sequential pattern mining and classification techniques. PCRSCM mines sequential patterns to find characteristics of each class, and then calculates pattern coverage rates and class scores to predict the class of a sequence. The experimental results show that PCRSCM exhibits excellent prediction performance on synthetic and real sequence data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhao, Y., Zhang, H., Wu, S., Pei, J., Cao, L., Zhang, C., Bohlscheid, H.: Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds.) ECML PKDD 2009, Part II. LNCS, vol. 5782, pp. 648–663. Springer, Heidelberg (2009)
Exarchos, T.P., Tsipouras, M.G., Papaloukas, C., Fotiadis, D.I.: A two-stage methodology for sequence classification based on sequential pattern mining and optimization. Data & Knowledge Engineering 66, 467–487 (2008)
Papageorgiou, D.G., Demetropoulos, I.N., Lagaris, I.E.: MERLIN-3.1.1. A new version of the Merlin optimization environment. Computer Physics Communications 159, 70–71 (2004)
Li, I.H., Lin, M.C., Liao, I.E.: A Sequential Pattern Length Based Sequence Classifier Model. In: International Conference on Information Management, p. 95 (2011)
Saneifar, H., Bringay, S., Laurent, A., Teisseire, M.: S2mp: Similarity Measure for Sequential Patterns. In: Proc. of 7th Australasian Data Mining Conference, pp. 95–104 (2008)
IBM Quest Market-Basket Synthetic Data Generator, http://www.cs.rpi.edu/~zaki/software/IBM-datagen.tar.gz
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, IH., Huang, JY., Liao, IE., Lin, JH. (2013). A Sequence Classification Model Based on Pattern Coverage Rate. In: Park, J.J.(.H., Arabnia, H.R., Kim, C., Shi, W., Gil, JM. (eds) Grid and Pervasive Computing. GPC 2013. Lecture Notes in Computer Science, vol 7861. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38027-3_81
Download citation
DOI: https://doi.org/10.1007/978-3-642-38027-3_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38026-6
Online ISBN: 978-3-642-38027-3
eBook Packages: Computer ScienceComputer Science (R0)