Skip to main content

Hashing for Financial Credit Risk Analysis

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8835))

Abstract

Hashing techniques have recently become the trend for accessing complex content over large data sets. With the overwhelming financial data produced today, binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The rationale is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. In this paper, first we use a semi-supervised hashing method to take into account the pairwise supervised information for constructing the weight adjacency graph matrix needed to learn the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash code. Third, the k-bits code for the test data is efficiently found in the recall phase. The results of hashing financial data show the applicability and advantages of the approach to credit risk assessment.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andoni, A., Indyk, P.: Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Communications of the ACM 51(1), 117–121 (2008)

    Article  Google Scholar 

  2. Baluja, S., Covell, M.: Learning to hash: forgiving hash functions and applications. Data Mining and Knowledge Discovery 17, 402–430 (2008)

    Article  MathSciNet  Google Scholar 

  3. Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation 15, 1373–1396 (2002)

    Article  Google Scholar 

  4. Bodo, Z., Csato, L.: Linear spectral hashing. Neurocomputing 141, 117–123 (2014)

    Article  Google Scholar 

  5. Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized non-negative matrix factorization for data representation. IEEE Trans. on Pattern Analysis and Machine Intelligence 33(8), 1548–1560 (2011)

    Article  Google Scholar 

  6. Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Trans. on Intelligent Systems and Technology 2, 27:1–27:27 (2011), http://www.csie.ntu.edu.tw/~cjlin/libsvm

  7. Chung, F.: Spectral Graph Theory. American Mathematical Society (1997)

    Google Scholar 

  8. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  9. Gordo, A., Perronnin, F., Gong, Y., Lazebnik, S.: Asymmetric distances for binary embeddings. IEEE Trans. on Pattern Analysis and Machine Intelligence 36(1), 33–47 (2014)

    Article  Google Scholar 

  10. Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: 30th STOC, pp. 604–613. ACM Press (1998)

    Google Scholar 

  11. Nene, S.A., Nayar, S.K.: A simple algorithm for nearest neighbor search in high dimensions. Tech. Rep. CUCS-030-95, CS Dep, University of Columbia, USA (1995)

    Google Scholar 

  12. Raginsky, M., Lazebnik, S.: Locality sensitive binary codes from shift-invariant kernels. In: Adv. in Neural Information Proc. Sys. (NIPS), pp. 1509–1517 (2009)

    Google Scholar 

  13. Ribeiro, B., Chen, N.: Graph weighted subspace learning models in bankruptcy. In: Proc. of Int. J. Conf. on Neural Networks (IJCNN), pp. 2055–2061. IEEE (2011)

    Google Scholar 

  14. Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approx. Reasoning 50(7), 969–978 (2009)

    Article  Google Scholar 

  15. Specht, D.F.: A general regression neural network. IEEE Transactions on Neural Networks 2(6), 568–576 (1991)

    Article  Google Scholar 

  16. Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Adv. in Neural Information Proc. Sys. 21 (NIPS), pp. 1753–1760 (2009)

    Google Scholar 

  17. Zhang, D., Wang, J., Cai, D., Lu, J.: Self-taught hashing for fast similarity search. In: Proc. of the 33rd Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 18–25. ACM (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Ribeiro, B., Chen, N. (2014). Hashing for Financial Credit Risk Analysis. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds) Neural Information Processing. ICONIP 2014. Lecture Notes in Computer Science, vol 8835. Springer, Cham. https://doi.org/10.1007/978-3-319-12640-1_48

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12640-1_48

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12639-5

  • Online ISBN: 978-3-319-12640-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics