Abstract
Computational Topological Data Analysis (TDA) is a collection of procedures which permits extracting certain robust features of high dimensional data, even when the number of data points is relatively small. Classical statistical data analysis is not very successful at or even cannot handle such situations altogether. Hidden features or structure in high dimensional data expresses some direct and indirect links between data points. Such may be the case when there are no explicit links between persons like clients in a database but there may still be important implicit links which characterize client populations and which also make different such populations more comparable. We explore the potential usefulness of applying TDA to different versions of credit scoring data, where clients are credit takers with a known defaulting behavior.
References
Carlsson, G.: Topology and data. Bull. (New Series) Am. Math. Soc. 46(2), 255–308 (2009)
Fasy, B.T., Kim, J., Lecci, F., Maria, C.: Introduction to the R package TDA (with the CMU TopStat Group). https://cran.r-project.org/web/packages/TDA/vignettes/article.pdf
Lumm, P.Y., Carlsson, G., Vejdemo-Johansson, M.: The topology of politics: voting connectivity in the US House of Representatives. http://www.cs.cmu.edu/~sbalakri/Topology_final_versions/politics.pdf
Maria C.: GUDHI, Simplicial Complexes and Persistent Homology Packages (2014). http://project.inria.fr/gudhi/software/
Morozov, D.: Dionysus, a C++ library for computing persistent homology (2007). http://www.mrzv.org/software/dionysus/
Nicolau, M., Levine, A., Carlsson, G.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc. Nat. Acad. Sci. 108, 7265–7270 (2011)
Perea, J., Harer, J.: Sliding windows and persistence: an application of topological methods to signal analysis (2014). arXiv:1307.6188v1
Schebesch, K.B., Stecking, R.: Clustering for data privacy and classification tasks. In: Huisman, D., Louwerse, I., Wagelmans, A.P.M. (eds.) Selected Papers of the International Conference on Operations Research OR2013, Rotterdam, 3–6 September 2013, Operations Research Proceedings, pp. 397–403. Springer (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this paper
Cite this paper
Schebesch, K.B., Stecking, R.W. (2017). Topological Data Analysis for Extracting Hidden Features of Client Data. In: Dörner, K., Ljubic, I., Pflug, G., Tragler, G. (eds) Operations Research Proceedings 2015. Operations Research Proceedings. Springer, Cham. https://doi.org/10.1007/978-3-319-42902-1_65
Download citation
DOI: https://doi.org/10.1007/978-3-319-42902-1_65
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42901-4
Online ISBN: 978-3-319-42902-1
eBook Packages: Business and ManagementBusiness and Management (R0)