skip to main content
10.1145/3523181.3523194acmotherconferencesArticle/Chapter ViewAbstractPublication PagesasseConference Proceedingsconference-collections
research-article

Cluster Analysis for Customer Segmentation with Open Banking Data

Published: 18 April 2022 Publication History

Abstract

Segmenting customers into different groups using their characteristics and behaviors has always been an important topic. Customer segmentation can lead to better customer understanding and targeting, which in turn leads to more effective product tailoring and marketing strategies. Data mining methods are powerful techniques that can be used in customer segmentation to find customers with similar characteristics. Past research that evaluated different data mining techniques has often had drawbacks, such as using too time-consuming methods or conducting studies with smaller data sets. Density-based clustering algorithms for customer segmentation, such as the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) has only been examined by a few research papers. This study, which summarized the main findings of the unpublished dissertation of Bartels [2021], aimed to classify the segmentation of customers using a Recency, Frequency and Monetary Value (RFM) Model and the clustering techniques, K-Means and DBSCAN, to find groups of similarities and differences and to discover potential valuable and vulnerable customers. The data used was from Open Banking data sets, including anonymized transactions from various bank customers in the UK for three months in 2017 and mainly focused on different types of expenses. K-Means found three clusters each month that represent the most, medium, and least valuable customers. While the most valuable customers have the highest average values per attribute, the least valuable customers have the lowest average values. The found clusters were analyzed and evaluated to find potential vulnerable and valuable groups, which can help with future product tailoring and marketing, especially for unforeseen emergency circumstances such as a pandemic. K-Means outperformed DBSCAN, as the latter showed negative silhouette coefficients.

References

[1]
(n.d.). The Global Open Finance Centre of Excellence. Retrieved 24.08. 2021 from https://ddi.ac.uk/case-studies/the-global-open-finance-centre-of-excellence/
[2]
(n.d.). 2.3. Clustering. Retrieved 11.08. 2021 from https://scikit-learn.org/stable/modules/clustering.html#
[3]
Abel, William, Key, Tomas and Shaw, Catherine. 2020. How persistent will the impact of Covid-19 on unemployment be? Retrieved 03.08. 2021 from https://www.bankofengland.co.uk/bank-overground/2020/how-persistent-will-the–impact-of-covid-19-on-unemployment-be
[4]
Bartels, Catja. 2021. Cluster Analysis for Customer Segmentation with Open Banking Data. Master's thesis, Business School, University of Edinburgh.
[5]
Cheng, Ching-Hsue and Chen, You-Shyang. 2009. Classifying the segmentation of customer value via RFM model and RS theory. Expert Systems with Applications, 36, 3, Part 1 (2009/04/01/ 2009), 4176-4184. https://doi.org/10.1016/j.eswa.2008.04.003
[6]
Dhandayudam, Prabha and Krishnamurthi, Ilango. 2014. A Rough Set Approach for Customer Segmentation. Data Science Journal, 13 (2014), 1-11. https://doi.org/10.2481/dsj.13-019
[7]
Economist, The. 2021. Will the economic recovery survive the end of emergency stimulus? Retrieved 03.08. 2021 from https://www.economist.com/finance-and-economics/2021/07/18/will-the-economic-recovery-survive-the-end-of-emergency-stimulus
[8]
Edmonds, Timothy. 2018. Open Banking: banking but not as we know it? Retrieved from https://commonslibrary.parliament.uk/research-briefings/cbp-8215/
[9]
Ester, Martin, Kriegel, Hans Peter, Schubert, Erich, Sander, Jörg and Xu, Xiaowei 2017. DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN. ACM Trans. Database Syst., 42, 3 (2017), 1–21. https://doi.org/10.1145/3068335
[10]
Fader, Peter S., Hardie, Bruce G.S. and Lee, Ka Lok. 2005. RFM and CLV: Using Iso-Value Curves for Customer Base Analysis. Journal of Marketing Research, 42, 4 (2005), 415-430. https://doi.org/10.1509/jmkr.2005.42.4.415
[11]
Farris, P.W., Bendle, N., Pfeifer, P.E. and Reibstein, D. 2010. Marketing Metrics: The Definitive Guide to Measuring Marketing Performance. Pearson Education.
[12]
Forgy, E. 1965. Cluster analysis of multivariate data : efficiency versus interpretability of classifications. Biometrics, 21 (1965), 768-769
[13]
Han, J., Pei, J. and Kamber, M. 2011. Data Mining: Concepts and Techniques. Elsevier Science.
[14]
Hossain, A. S. M. S. 2017. Customer segmentation using centroid based and density based clustering algorithms. Khulna, Bangladesh. https://doi.org/10.1109/EICT.2017.8275249
[15]
Hughes, Arthur Middleton. 1994. Strategic database marketing : the masterplan for starting and managing a profitable, customer-based marketing program. Probus Pub. Co.
[16]
Khalili-Damghani, Kaveh, Abdi, Farshid and Abolmakarem, Shaghayegh. 2018. Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: Real case of customer-centric industries. Applied Soft Computing, 73 (2018/12/01/ 2018), 816-828. https://doi.org/10.1016/j.asoc.2018.09.001
[17]
Liao, Shu-hsien, Chen, Yin-ju and Lin, Yi-tsun. 2011. Mining customer knowledge to implement online shopping and home delivery for hypermarkets. Expert Systems with Applications, 38, 4 (2011/04/01/ 2011), 3982-3991. https://doi.org/10.1016/j.eswa.2010.09.059
[18]
Ling, Raymond and Yen, David C. 2001. Customer Relationship Management: An Analysis Framework and Implementation Strategies. Journal of Computer Information Systems, 41, 3 (2001/03/01 2001), 82-97. https://doi.org/10.1080/08874417.2001.11647013
[19]
Liu, Duen-Ren and Shih, Ya-Yueh. 2005. Integrating AHP and data mining for product recommendation based on customer lifetime value. Information & Management, 42, 3 (2005/03/01/ 2005), 387-400. https://doi.org/10.1016/j.im.2004.01.008
[20]
MacQueen, James. 1967. Some methods for classification and analysis of multivariate observations. Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, Oakland, CA, USA.
[21]
Maimon, O. and Rokach, L. 2010. Data Mining and Knowledge Discovery Handbook. Springer US.
[22]
Newell, Frederick. 1997. The New Rules of Marketing: How to Use One-To-One Relationship Marketing to Be the Leader in Your Industry. McGraw-Hill.
[23]
Open Banking Limited. 2021. ABOUT THE OBIE. Retrieved 24.08. 2021 from https://www.openbanking.org.uk/about-us/
[24]
Schijns, Jos M. C. and Schröder, Gaby J. 1996. Segment selection by relationship strength. Journal of Direct Marketing, 10, 3 (1996), 69-79. https://doi.org/10.1002/(SICI)1522-7138(199622)10:369::AID-DIR63.0.CO;2-W
[25]
Smith, Wendell R. 1956. Product Differentiation and Market Segmentation as Alternative Marketing Strategies. Journal of Marketing, 21, 1 (1956), 3-8. 10.1177/002224295602100102
[26]
Stone, B. 1984. Successful Direct Marketing Methods: The Bob Stone Direct Marketing Book. Crain Books.
[27]
Sun, Zhao-Hui, Zuo, Tian-Yu, Liang, Di, Ming, Xinguo, Chen, Zhihua and Qiu, Siqi. 2021. GPHC: A heuristic clustering method to customer segmentation. Applied Soft Computing, 111 (2021/11/01/ 2021), 107677. https://doi.org/10.1016/j.asoc.2021.107677
[28]
Wang, X., Zhou, C., Yang, Y., Yang, Y., Ji, T., Wang, J., Chen, J. and Zheng, Y. . 2020. Electricity Market Customer Segmentation Based on DBSCAN and k-Means: A Case on Yunnan Electricity Market. Conference Location, https://doi.org/10.1109/AEEES48850.2020.9121413
[29]
Wei-jiang, Liu, Shu-Yong, Duan, Xue, Yang and Xiaofeng, Wang. 2011. Determination of customer value measurement model RFM index weights. African Journal of Business Management, 5 (2011), 5567-5572. 10.5897/AJBM11.290
[30]
Weinstein, Art. 2004. Handbook of Market Segmentation : Strategic Targeting for Business and Technology Firms, Third Edition. Taylor & Francis Group.
[31]
Witten, I.H. and Frank, E. 2005. Data Mining: Practical Machine Learning Tools and Techniques, Second Edition. Elsevier Science.
[32]
Wu, Jing and Lin, Zheng. 2005. Research on customer segmentation model by clustering. In Proceedings of the Proceedings of the 7th international conference on Electronic commerce. Association for Computing Machinery, Xi'an, China, 316–318. https://doi.org/10.1145/1089551.1089610

Cited By

View all
  • (2025)Analyzing Bank Customer Behavior: Segmentation and Prediction Using Big Data AnalyticsComputing Technologies for Sustainable Development10.1007/978-3-031-82383-1_18(223-250)Online publication date: 15-Feb-2025
  • (2024)Optimal data-driven strategy for in-house and outsourced technological innovations by open banking APIsFuture Business Journal10.1186/s43093-024-00397-310:1Online publication date: 19-Nov-2024
  • (2024)Advancing Customer Segmentation in Banking: Harnessing Machine Learning and H2O for Personalized InsightsProceedings of the 10th International Conference on Advanced Intelligent Systems and Informatics 202410.1007/978-3-031-71619-5_21(246-256)Online publication date: 13-Oct-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ASSE' 22: 2022 3rd Asia Service Sciences and Software Engineering Conference
February 2022
202 pages
ISBN:9781450387453
DOI:10.1145/3523181
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 April 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Cluster Analysis
  2. DBSCAN
  3. K-Means
  4. Open Banking
  5. RFM Model

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ASSE' 22

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)101
  • Downloads (Last 6 weeks)7
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Analyzing Bank Customer Behavior: Segmentation and Prediction Using Big Data AnalyticsComputing Technologies for Sustainable Development10.1007/978-3-031-82383-1_18(223-250)Online publication date: 15-Feb-2025
  • (2024)Optimal data-driven strategy for in-house and outsourced technological innovations by open banking APIsFuture Business Journal10.1186/s43093-024-00397-310:1Online publication date: 19-Nov-2024
  • (2024)Advancing Customer Segmentation in Banking: Harnessing Machine Learning and H2O for Personalized InsightsProceedings of the 10th International Conference on Advanced Intelligent Systems and Informatics 202410.1007/978-3-031-71619-5_21(246-256)Online publication date: 13-Oct-2024
  • (2023)Optimizing Big Data Implementation to Create Business Value and Architecture Proposed in the Banking Industry: A Systematic Review2023 International Conference on Computer Science, Information Technology and Engineering (ICCoSITE)10.1109/ICCoSITE57641.2023.10127701(667-672)Online publication date: 16-Feb-2023
  • (2023)Experimental Analysis on Banking Customer Segmentation using Machine Learning Techniques2023 Global Conference on Information Technologies and Communications (GCITC)10.1109/GCITC60406.2023.10426116(1-6)Online publication date: 1-Dec-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media