Abstract
Person names often need to be represented in a consistent format in an application, for example, in <Last Name, Given Name, Suffix> format in library catalogs. Obtaining a normalized representation automatically from an input name requires precise labeling of its components. The process is difficult owing to numerous cultural conventions in writing personal names. In this paper, we propose deep learning-based techniques to achieve this using sequence-to-sequence learning. We design several architectures using a bidirectional long short-term memory (BiLSTM)-based recurrent neural network (RNN). We compare these methods with one based on the hidden Markov model. We perform experiments on a large collection of author names drawn from the National Digital Library of India. The best accuracy of \(94\%\) is achieved by the character-level BiLSTM with a conditional random field at the output layer. We also show visualizations of the vectors (representing person names) learned by a BiLSTM and how these vectors are clustered according to name structures. Our study shows that deep learning is a promising approach to automatic name segmentation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Borkar, V., Deshmukh, K., Sarawagi, S.: Automatic segmentation of text into structured records. In: ACM SIGMOD Record, vol. 30, pp. 175–186. ACM (2001)
Churches, T., Christen, P., Lim, K., Zhu, J.X.: Preparation of name and address data for record linkage using hidden Markov models. BMC Med. Inform. Decis. Mak. 2(1), 9 (2002)
Das, G.S., Li, X., Sun, A., Kardes, H., Wang, X.: Person-name parsing for linking user web profiles. In: Proceedings of the 18th International Workshop on Web and Databases, pp. 20–26. ACM (2015)
Deng, L.: A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Trans. Signal Inf. Process. 3, e2 (2014)
Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the SIGKDD Conference on Knowledge Discovery and Data Mining 1996, pp. 226–231 (1996)
Forney, G.D.: The Viterbi algorithm. Proc. IEEE 61(3), 268–278 (1973)
Gonçalves, R.D.C.B., Freire, S.M.: Name segmentation using hidden Markov models and its application in record linkage. Cadernos de Saude Publica 30(10), 2039–2048 (2014)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Johnson, S.B., Bales, M.E., Dine, D., Bakken, S., Albert, P.J., Weng, C.: Automatic generation of investigator bibliographies for institutional research networking systems. J. Biomed. Inform. 51, 8–14 (2014)
Keras-Team: Keras documentation (2018). https://keras.io/. Accessed 09 Mar 2019
Lipton, Z.C., Berkowitz, J., Elkan, C.: A critical review of recurrent neural networks for sequence learning. arXiv preprint arXiv:1506.00019 (2015)
van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(Nov), 2579–2605 (2008)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)
Sarawagi, S.: Information extraction. Found. Trends Databases 1(3), 261–377 (2008)
Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends® Mach. Learn. 4(4), 267–373 (2012)
Yadav, V., Bethard, S.: A survey on recent advances in named entity recognition from deep learning models. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2145–2158 (2018)
Zeyer, A., Doetsch, P., Voigtlaender, P., Schlüter, R., Ney, H.: A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2462–2466. IEEE (2017)
Acknowledgements
This work is supported by the National Digital Library of India Project sponsored by the Ministry of Human Resource Development, Government of India at IIT Kharagpur.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Santosh, T.Y.S.S., Sanyal, D.K., Das, P.P. (2020). Person Name Segmentation with Deep Neural Networks. In: B. R., P., Thenkanidiyoor, V., Prasath, R., Vanga, O. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2019. Lecture Notes in Computer Science(), vol 11987. Springer, Cham. https://doi.org/10.1007/978-3-030-66187-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-66187-8_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-66186-1
Online ISBN: 978-3-030-66187-8
eBook Packages: Computer ScienceComputer Science (R0)