Abstract
Retaining users and customers is one of the most important challenges for the service industry from mobile communications to online gaming. As the users of these services form dynamic networks that grow in size, predicting ‘churners’ becomes harder and harder. In this work, we explore the use of anomaly detection for churn prediction. To this end, we evaluate bio-inspired and deterministic online clustering algorithms on both cell phone and online gaming data sets. We discuss the results of each technique from the perspective of: feature identification, sensitivity analysis of the parameters as well as their capacity to detect churn.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: Proceedings of the 29th International Conference on Very Large Data Bases, vol. 29, pp. 81–92. VLDB Endowment (2003)
Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for projected clustering of high dimensional data streams. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 852–863. VLDB Endowment (2004)
Ali, Ö.G., Arıtürk, U.: Dynamic churn prediction framework with more effective use of rare event data: The case of private banking. Expert Syst. Appl. 41(17), 7889–7903 (2014)
Bifet, A., Holmes, G., Pfahringer, B., Kranen, P., Kremer, H., Jansen, T., Seidl, T.: Moa: Massive online analysis, a framework for stream classification and clustering (2010)
Cao, F., Ester, M., Qian, W., Zhou, A.: Density-based clustering over an evolving data stream with noise. In: SDM, vol. 6, pp. 328–339. SIAM (2006)
Eberhart, R.C., Shi, Y., Kennedy, J.: Swarm Intelligence. Elsevier, London (2001)
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd. 96, 226–231 (1996)
Forestiero, A., Pizzuti, C., Spezzano, G.: Flockstream: a bio-inspired algorithm for clustering evolving data streams. In: 21st International Conference on Tools with Artificial Intelligence, ICTAI 2009, pp. 1–8. IEEE (2009)
Guttman, A.: R-trees: a dynamic index structure for spatial searching, vol. 14. ACM (1984)
Guyon, I., Lemaire, V., Boullé, M., Dror, G., Vogel, D.: Analysis of the kdd cup 2009: Fast scoring on a large orange customer database (2009)
Huang, B.Q., Kechadi, T.M., Buckley, B., Kiernan, G., Keogh, E., Rashid, T.: A new feature set with new window techniques for customer churn prediction in land-line telecommunications. Expert Syst. Appl. 37(5), 3657–3665 (2010)
Karahoca, A., Karahoca, D.: Gsm churn management by using fuzzy c-means clustering and adaptive neuro fuzzy inference system. Expert Syst. Appl. 38(3), 1814–1822 (2011)
Kranen, P., Assent, I., Baldauf, C., Seidl, T.: The clustree: indexing micro-clusters for anytime stream mining. Knowl. Inf. Syst. 29(2), 249–272 (2011)
Lee, Y.H., Wei, C.P., Cheng, T.H., Yang, C.T.: Nearest-neighbor-based approach to time-series classification. Decis. Support Syst. 53(1), 207–217 (2012)
Moise, G., Sander, J., Ester, M.: P3c: A robust projected clustering algorithm. In: Sixth International Conference on Data Mining, 2006, ICDM 2006, pp. 414–425. IEEE (2006)
Mozer, M.C., Wolniewicz, R., Grimes, D.B., Johnson, E., Kaushansky, H.: Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry. IEEE Trans. Neural Netw. 11(3), 690–696 (2000)
Neslin, S.A., Gupta, S., Kamakura, W., Lu, J., Mason, C.H.: Defection detection: Measuring and understanding the predictive accuracy of customer churn models. J. Mark. Res. 43(2), 204–211 (2006)
Pendharkar, P.C.: Genetic algorithm based neural network approaches for predicting churn in cellular wireless network services. Expert Syst. Appl. 36(3), 6714–6720 (2009)
Reynolds, C.W.: Flocks, herds and schools: A distributed behavioral model. ACM Siggraph Comput. Graph. 21(4), 25–34 (1987)
Verbeke, W., Dejaeger, K., Martens, D., Hur, J., Baesens, B.: New insights into churn prediction in the telecommunication sector: A profit driven data mining approach. Eur. J. Oper. Res. 218(1), 211–229 (2012)
Vogel, D., Guyon, I.: Kdd cup 2009: Customer relationship prediction
Zhang, T., Ramakrishnan, R., Livny, M.: Birch: an efficient data clustering method for very large databases. In: ACM SIGMOD Record, vol. 25, pp. 103–114. ACM (1996)
Zhao, J., Dang, X.H.: Bank customer churn prediction based on support vector machine: taking a commercial bank’s vip customer churn as the example. In: 4th International Conference on Wireless Communications, Networking and Mobile Computing, WiCOM 2008, pp. 1–4. IEEE (2008)
Acknowledgement
This research is supported by the Mitacs Accelerate Internship grant, and is conducted as part of the Dalhousie NIMS Lab at: https://projects.cs.dal.ca/projectx.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Tatar, S.B., McIntyre, A., Zincir-Heywood, N., Heywood, M. (2015). Benchmarking Stream Clustering for Churn Detection in Dynamic Networks. In: Japkowicz, N., Matwin, S. (eds) Discovery Science. DS 2015. Lecture Notes in Computer Science(), vol 9356. Springer, Cham. https://doi.org/10.1007/978-3-319-24282-8_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-24282-8_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24281-1
Online ISBN: 978-3-319-24282-8
eBook Packages: Computer ScienceComputer Science (R0)