Hashing for Financial Credit Risk Analysis

Ribeiro, Bernardete; Chen, Ning

doi:10.1007/978-3-319-12640-1_48

Hashing for Financial Credit Risk Analysis

Bernardete Ribeiro²⁰ &
Ning Chen²¹

Conference paper

2384 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8835))

Abstract

Hashing techniques have recently become the trend for accessing complex content over large data sets. With the overwhelming financial data produced today, binary embeddings are efficient tools of indexing big datasets for financial credit risk analysis. The rationale is to find a good hash function such that similar data points in Euclidean space preserve their similarities in the Hamming space for fast data retrieval. In this paper, first we use a semi-supervised hashing method to take into account the pairwise supervised information for constructing the weight adjacency graph matrix needed to learn the binarised Laplacian EigenMap. Second, we train a generalised regression neural network (GRNN) to learn the k-bits hash code. Third, the k-bits code for the test data is efficiently found in the recall phase. The results of hashing financial data show the applicability and advantages of the approach to credit risk assessment.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Andoni, A., Indyk, P.: Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Communications of the ACM 51(1), 117–121 (2008)
Article Google Scholar
Baluja, S., Covell, M.: Learning to hash: forgiving hash functions and applications. Data Mining and Knowledge Discovery 17, 402–430 (2008)
Article MathSciNet Google Scholar
Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation 15, 1373–1396 (2002)
Article Google Scholar
Bodo, Z., Csato, L.: Linear spectral hashing. Neurocomputing 141, 117–123 (2014)
Article Google Scholar
Cai, D., He, X., Han, J., Huang, T.S.: Graph regularized non-negative matrix factorization for data representation. IEEE Trans. on Pattern Analysis and Machine Intelligence 33(8), 1548–1560 (2011)
Article Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Trans. on Intelligent Systems and Technology 2, 27:1–27:27 (2011), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chung, F.: Spectral Graph Theory. American Mathematical Society (1997)
Google Scholar
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)
Article Google Scholar
Gordo, A., Perronnin, F., Gong, Y., Lazebnik, S.: Asymmetric distances for binary embeddings. IEEE Trans. on Pattern Analysis and Machine Intelligence 36(1), 33–47 (2014)
Article Google Scholar
Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: 30th STOC, pp. 604–613. ACM Press (1998)
Google Scholar
Nene, S.A., Nayar, S.K.: A simple algorithm for nearest neighbor search in high dimensions. Tech. Rep. CUCS-030-95, CS Dep, University of Columbia, USA (1995)
Google Scholar
Raginsky, M., Lazebnik, S.: Locality sensitive binary codes from shift-invariant kernels. In: Adv. in Neural Information Proc. Sys. (NIPS), pp. 1509–1517 (2009)
Google Scholar
Ribeiro, B., Chen, N.: Graph weighted subspace learning models in bankruptcy. In: Proc. of Int. J. Conf. on Neural Networks (IJCNN), pp. 2055–2061. IEEE (2011)
Google Scholar
Salakhutdinov, R., Hinton, G.: Semantic hashing. Int. J. Approx. Reasoning 50(7), 969–978 (2009)
Article Google Scholar
Specht, D.F.: A general regression neural network. IEEE Transactions on Neural Networks 2(6), 568–576 (1991)
Article Google Scholar
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Adv. in Neural Information Proc. Sys. 21 (NIPS), pp. 1753–1760 (2009)
Google Scholar
Zhang, D., Wang, J., Cai, D., Lu, J.: Self-taught hashing for fast similarity search. In: Proc. of the 33rd Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, pp. 18–25. ACM (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

CISUC - Department of Informatics Engineering, University of Coimbra, Portugal
Bernardete Ribeiro
GECAD, Instituto Superior de Engenharia do Porto, Portugal
Ning Chen

Authors

Bernardete Ribeiro
View author publications
You can also search for this author in PubMed Google Scholar
Ning Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Artificial Intelligence, Faculty of Computer Science and Information Technology Building, University of Malaya, 50603, Kuala Lumpur, Malaysia
Chu Kiong Loo
Department of Electronics and Communication Engineering, College of Engineering, Universiti Tenaga Nasional, Jalan IKRAM-UNITEN, 43009, Kajang, Selangor, Malaysia
Keem Siah Yap
School of Engineering and Information Technology, Murdoch University, South St., 6150, Murdoch, Western Australia, Australia
Kok Wai Wong
Department of Electrical and Electronics Engineering, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, 120-749, Seoul, South Korea
Andrew Teoh
Department of Electrical and Electronic Engineering, Xi’an Jiaotong-Liverpool University, Ren’ai Road 111, SIP 215123, Suzhou, Jiangsu Province, China
Kaizhu Huang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ribeiro, B., Chen, N. (2014). Hashing for Financial Credit Risk Analysis. In: Loo, C.K., Yap, K.S., Wong, K.W., Teoh, A., Huang, K. (eds) Neural Information Processing. ICONIP 2014. Lecture Notes in Computer Science, vol 8835. Springer, Cham. https://doi.org/10.1007/978-3-319-12640-1_48

Download citation

DOI: https://doi.org/10.1007/978-3-319-12640-1_48
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12639-5
Online ISBN: 978-3-319-12640-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics