Abstract
Customer transactions tend to change over time with changing customer behaviour patterns. Classifier models, however, are often designed to perform prediction on data which is assumed to be static. These classifier models thus deteriorate in performance over time when predicting in the context of evolving data. Robust adaptive classification models are therefore needed to detect and adjust to the kind of changes that are common in transactional data. This paper presents an investigation into using change mining to monitor the adaptive classification of customers based on their transactions through moving time windows. The classification performance of two-class decision tree ensembles built using the data binning process based on the number of items purchased was monitored over varying 3, 6, 9 and 12 months time windows. The changing class values of the customer profiles were analysed and described. Results from our experiments show that the proposed approach can be used for learning and adapting to changing customer profiles.
Similar content being viewed by others
Notes
The learning process alluded to here is the “learning” of the class of the customer profile as returned by the static classifier in each of the varying time windows.
Reference
Adomavicius G, Tuzhilin A (2011) Context-aware recommender systems. In: Ricci F, Rokach L, Shapira B, Kantor PB (eds) Context-aware recommender systems. Springer, Berlin, pp 217–253
Apeh E, Gabrys B (2011) Change mining of customer profiles based on transactional data. In: Proceedings of the 11th IEEE international conference on data mining workshops (ICDMW 2011). IEEE
Apeh E, Gabrys B, Schierz A (2011) Customer profile classification using transactional data. In: Proceedings of the third world congress on nature and biologically inspired computing (NaBIC2011). IEEE
Battiti R (1994) Using mutual information for selecting features in supervised neural net learning. IEEE Trans Neural Netw 5(4):537–550
Basik M, Feltes D (1999) Defining your customer profile—an essential tool. J Ext (online), 37(6). Available at: http://www.joe.org/joe/1999december/a4.html
Berry MJA, Linoff GS (2000) Mastering data mining: the art and science of customer relationship management. Wiley, Hoboken
Berry MJA, Linoff GS (2004) Data mining techniques: for marketing, sales, and customer relationship management. Wiley, Hoboken
Böttcher M (2011) Contrast and change mining. Wiley Interdisc Rew: Data Mining and Knowledge Discovery 1(3):215–230
Böttcher M, Höppner F, Spiliopoulou M (2008) On exploiting the power of time in data mining. SIGKDD Explor Newsl 10:3–11
Böttcher M, Ru G, Nauck D, Kruse R (2009) Post-mining of association rules: techniques for effective knowledge extraction, IGI Global, chap from change mining to relevance feedback: a unified view on assessing rule interestingness, pp 12–37
Böttcher M, Spott M, Nauck D, Kruse R (2009) Mining changing customer segments in dynamic markets. Expert Syst Appl 36:155–164
Böttcher M, Nauck D, Borgelt C, Kruse R (2010) Temporal aspects in data mining. In: WCCI 2010 Plenary and Invited Lectures, Institute of Electrical and Electronics Engineering, Inc., pp 1–22
Catlett J (1991) On changing continuous attributes into ordered discrete attributes. In: Proceedings of the European working session on learning on machine learning, Springer, New York, pp 164–178
Chiu DKY, Wong AKC, Chan KCC (1991) Synthesis of statistical knowledge from time-dependent data. IEEE Trans Pattern Anal Mach Intell 13:265–271
Chmielewski MR, Grzymala-Busse JW (1995) Global discretization of continuous attributes as preprocessing for machine learning. In: Proceedings of the Third International Workshop on Rough Sets and Soft Computing, San Jose, pp 474–480
Church KW, Li P, Hastie TJ (2006) Conditional random sampling: a sketch-based sampling technique for sparse data. In: In NIPS, pp 873–880
Ester M, Kriegel HP, Sander J, Wimmer M, Xu X (1998) Incremental clustering for mining in a data warehousing environment. In: Proceedings of the 24rd international conference on very large data bases. Morgan Kaufmann Publishers Inc., San Francisco, CA, VLDB ’98, pp 323–333
Gemulla R (2008) Sampling algorithms for evolving datasets. PhD thesis, Technische UniversitSt Dresden
Gemulla R, Lehner W (2008) Sampling time-based sliding windows in bounded space. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, ACM, New York, SIGMOD ’08, pp 379–392
Giannotti F, Gozzi C, Manco G (2002) Clustering transactional data. In: Proceedings of the 6th European conference on principles of data mining and knowledge discovery, Springer, London, PKDD ’02, pp 175–187
Günther C, Rinderle S, Reichert M, van der Aalst W (2006) Change mining in adaptive process management systems. On the move to meaningful internet systems 2006: CoopIS, DOA, GADA, and ODBASE pp 309–326
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explor Newsl 11:10–18
Han J, Kamber M (2006) Data Mining, 2nd edn. Morgan Kaufmann, San Francisco
Holte RC (1993) Very simple classification rules perform well on most commonly used datasets. Mach Learn 11:63–90
Jin P, Zhu Y (2007) Mining customer change model based on swarm intelligence. In: Proceedings of the 3rd international conference on intelligent computing: advanced intelligent computing theories and applications. With aspects of artificial intelligence, ICIC ’07, Springer, Berlin, pp 456–464
Kerber R (1992) Chimerge: discretization of numeric attributes. In: Proceedings of the ninth international conference on artificial intelligence, pp 123–128
Klinkenberg R (2004) Learning drifting concepts: example selection vs. example weighting. Intell Data Anal 8:281–300
Kohavi R, Sahami M (1996) Error-based and entropy-based discretization of continuous features. In: Proceedings of the 13th international joint conference on artificial intelligence, pp 1022–1027
Kruse R, Steinbrecher M, Moewes C (2010) Temporal pattern mining. In: International conference on signals and electronic systems (ICSES), pp 3–8
Kuncheva L, Bezdek JC, Duin R (2001) Decision templates for multiple classifier fusion: an experimental comparison. Pattern Recogn 34(2):299–314
Li RP, Wang ZO (2002) An entropy-based discretization method for classification rules with inconsistency checking. In: Proceedings of the international conference on machine learning and cybernetics, vol 1, pp 243–246
Liu X, Wang H (2005) A discretization algorithm based on a heterogeneity criterion. IEEE Trans Knowl Data Eng 17(9):1166–1173
Mannila H (2000) Theoretical frameworks for data mining. SIGKDD Explor Newsl 1:30–32
Ngai E, Xiu L, Chau D (2009) Application of data mining techniques in customer relationship management: a literature review and classification. Expert Syst Appl 36(2):2592–2602
Nisbet R, Elder J, Miner G (2009) Handbook of statistical analysis and data mining applications. Academic Press, London
Polikar R (2006) Ensemble based systems in decision making. IEEE Circuits Syst Mag 6(3):21 –45
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco
Ruta D, Gabrys B (2000) An overview of classifier fusion methods. Comput Inf Syst 7(1):1–10
Song HS, Kim JK, Kim SH (2001) Mining the change of customer behavior in an internet shopping mall. Expert Syst Appl 21(3):157–168
Subramanian A, Pramala S, Rajalakshmi B, Rajaram R (2010) Improving decision tree performance by exception handling. Int J Autom Comput 7(3):372–380
Žliobaite I, Bakker J, Pechenizkiy M (2012) Beating the baseline prediction in food sales: how intelligent an intelligent predictor is? Expert Syst Appl 39(1):806–815
Wang J, Karypis G (2004) Summary: Efficiently summarizing transactions for clustering. In: Proceedings of the fourth IEEE international conference on data mining, IEEE Computer Society, Washington, DC, ICDM ’04, pp 241–248
Webb GI, Pazzani MJ, Billsus D (2001) Machine learning for user modeling. User Model User-Adap Inter 11(1–2):19–29
Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Tech. Rep. 1, Hingham
Witten IH, Frank E (1999) Practical machine learning tools and techniques with java implementations, The Morgan Kaufmann series in data management systems, vol 1. Elsevier, Amsterdam
Yan H, Chen K, Liu L (2006) Efficiently clustering transactional data with weighted coverage density. In: Proceedings of the 15th ACM international conference on Information and knowledge management, ACM, New York, NY, CIKM ’06, pp 367–376
Yan H, Chen K, Liu L, Yi Z (2010) Scale: a scalable framework for efficiently clustering transactional data. Data Min Knowl Discov 20(1):1–27
Yu PS (1999) Data mining and personalization technologies. In: Proceedings of the sixth international conference on database systems for advanced applications, IEEE Computer Society, Washington, pp 6–13
Acknowledgments
This research work was jointly funded by Great Western Research and Screwfix Limited.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Apeh, E., Gabrys, B. Detecting and Visualizing the Change in Classification of Customer Profiles based on Transactional Data. Evolving Systems 4, 27–42 (2013). https://doi.org/10.1007/s12530-012-9065-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12530-012-9065-2