Skip to main content

Topological Data Analysis for Extracting Hidden Features of Client Data

  • Conference paper
  • First Online:
Book cover Operations Research Proceedings 2015

Part of the book series: Operations Research Proceedings ((ORP))

Abstract

Computational Topological Data Analysis (TDA) is a collection of procedures which permits extracting certain robust features of high dimensional data, even when the number of data points is relatively small. Classical statistical data analysis is not very successful at or even cannot handle such situations altogether. Hidden features or structure in high dimensional data expresses some direct and indirect links between data points. Such may be the case when there are no explicit links between persons like clients in a database but there may still be important implicit links which characterize client populations and which also make different such populations more comparable. We explore the potential usefulness of applying TDA to different versions of credit scoring data, where clients are credit takers with a known defaulting behavior.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Carlsson, G.: Topology and data. Bull. (New Series) Am. Math. Soc. 46(2), 255–308 (2009)

    Article  Google Scholar 

  2. Fasy, B.T., Kim, J., Lecci, F., Maria, C.: Introduction to the R package TDA (with the CMU TopStat Group). https://cran.r-project.org/web/packages/TDA/vignettes/article.pdf

  3. Lumm, P.Y., Carlsson, G., Vejdemo-Johansson, M.: The topology of politics: voting connectivity in the US House of Representatives. http://www.cs.cmu.edu/~sbalakri/Topology_final_versions/politics.pdf

  4. Maria C.: GUDHI, Simplicial Complexes and Persistent Homology Packages (2014). http://project.inria.fr/gudhi/software/

  5. Morozov, D.: Dionysus, a C++ library for computing persistent homology (2007). http://www.mrzv.org/software/dionysus/

  6. Nicolau, M., Levine, A., Carlsson, G.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc. Nat. Acad. Sci. 108, 7265–7270 (2011)

    Article  Google Scholar 

  7. Perea, J., Harer, J.: Sliding windows and persistence: an application of topological methods to signal analysis (2014). arXiv:1307.6188v1

  8. Schebesch, K.B., Stecking, R.: Clustering for data privacy and classification tasks. In: Huisman, D., Louwerse, I., Wagelmans, A.P.M. (eds.) Selected Papers of the International Conference on Operations Research OR2013, Rotterdam, 3–6 September 2013, Operations Research Proceedings, pp. 397–403. Springer (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Klaus B. Schebesch .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this paper

Cite this paper

Schebesch, K.B., Stecking, R.W. (2017). Topological Data Analysis for Extracting Hidden Features of Client Data. In: Dörner, K., Ljubic, I., Pflug, G., Tragler, G. (eds) Operations Research Proceedings 2015. Operations Research Proceedings. Springer, Cham. https://doi.org/10.1007/978-3-319-42902-1_65

Download citation

Publish with us

Policies and ethics