Abstract
In this paper we describe how the co-author network, which is built from the bibliographic records, can be incorporated into the process of personal name language classification. The model is tested on the DBLP data set. The results show that the extension of the language classification process with the co-author network may help to refine the name language classification obtained from the author names considered independently. It may also lead to the discovery of dependencies between the elements of the co-author network, or participation of authors in scientific communities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Börner, K., Dall’Asta, L., Ke, W., Vespignani, A.: Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams. Complexity 10(4) (2005)
Google: Google Scholar, http://scholar.google.com/
Han, H., Zha, H., Lee, G.C.: Name disambiguation in author citations using a K-way spectral clustering method. In: ACM/IEEE Joint Conference on Digital Libraries, pp. 334–343 (2005)
Hiemstra, D., Hauff, C., Jong, F., Kraaij, W.: SIGIR’s 30th anniversary: an analysis of trends in IR research and the topology of its community. In: SIGIR Forum, vol. 41(2) (2007)
Huang, T., Huang, M.L.: Analysis and Visualization of Co-authorship Networks for Understanding Academic Collaboration and Knowledge Domain of Individual Researchers. In: 4th International Conference on Computer Graphics, Imaging and Visualization, pp. 18–23 (2006)
Ke, W., Börner, K., Viswanath, L.: Major Information Visualization Authors, Papers and Topics in the ACM Library. In: 10th IEEE Symposium on Information Visualization (2004)
Klink, S., Reuther, P., Weber, A., Walter, B., Ley, M.: Analysing Social Networks Within Bibliographical Data. In: Database and Expert Systems Applications, pp. 234–243 (2006)
Lee, C.G., Bollacker, K.D., Lawrence, S.: CiteSeer: An Automatic Citation Indexing System. In: ACM Digital Libraries, pp. 89–98 (1998), http://citeseer.ist.psu.edu/
Lee, D., On, B., Kang, J., Park, S.: Effective and scalable solutions for mixed and split citation problems in digital libraries. In: Workshop on Information Quality in Information Systems, pp. 69–76 (2005)
Ley, M.: DBLP, Computer Science Bibliography, http://www.informatik.uni-trier.de/~ley/db/
Mei, Q., Cai, D., Zhang, D., Zhai, C.: Topic modeling with network regularization. In: 17th International World Wide Web Conferences, pp. 101–110 (2008)
Murray, C., Ke, W., Börner, K.: Mapping Scientific Disciplines and Author Expertise Based on Personal Bibliography Files. In: 10th International Conference on Information Visualisation, vol. IV, pp. 258–263 (2006)
Reuther, P., Walter, B., Ley, M., Weber, A., Klink, S.: Managing the Quality of Person Names in DBLP. In: European Conference on Digital Libraries, pp. 508–511 (2006)
Rodrigues, J.F., Tong, H., Traina, A., Faloutsos, C., Leskovec, J.: GMine: A System for Scalable, Interactive Graph Visualization and Mining. In: 32nd Very Large Data Bases, pp. 1195–1198 (2006)
Shannon, C.E.: The Mathematical Theory of Communication. Bell System Technical Journal 27 (1948)
SIGIR – Special Interest Group on Information Retrieval, http://www.sigir.org/
Smeaton, A.F., Keogh, G., Gurrin, C., McDonald, K., Sødring, T.: Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? SIGIR Forum 37(1) (2003)
Steyvers, M., Smyth, P., Rosen-Zvi, M., Griffiths, T.L.: Probabilistic author-topic models for information discovery. In: Knowledge Discovery and Data Mining, pp. 306–315 (2004)
Wikipedia, the Free Encyclopedia, Lists of People, http://en.wikipedia.org/wiki/
Zhang, H., Qiu, B., Lee, C.G., Foley, H.C., Yen, J.: An LDA-based Community Structure Discovery Approach for LLarge-Scale Social Networks. In: IEEE International Conference on Intelligence and Security Informatics, pp. 200–207. IEEE Press, New York (2007)
Zhang, H., Qiu, B., Lee, C.G., Foley, H.C., Yen, J.: Probabilistic Community Discovery Using Hierarchical Latent Gaussian Mixture Model. In: Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, pp. 663–668 (2007)
Detection of Personal Name Language Origin and Applications (submitted, 2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Biryukov, M. (2008). Co-author Network Analysis in DBLP: Classifying Personal Names. In: Le Thi, H.A., Bouvry, P., Pham Dinh, T. (eds) Modelling, Computation and Optimization in Information Systems and Management Sciences. MCO 2008. Communications in Computer and Information Science, vol 14. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87477-5_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-87477-5_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87476-8
Online ISBN: 978-3-540-87477-5
eBook Packages: Computer ScienceComputer Science (R0)