Skip to main content

Abstract

In this paper we describe how the co-author network, which is built from the bibliographic records, can be incorporated into the process of personal name language classification. The model is tested on the DBLP data set. The results show that the extension of the language classification process with the co-author network may help to refine the name language classification obtained from the author names considered independently. It may also lead to the discovery of dependencies between the elements of the co-author network, or participation of authors in scientific communities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Börner, K., Dall’Asta, L., Ke, W., Vespignani, A.: Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams. Complexity 10(4) (2005)

    Google Scholar 

  2. Google: Google Scholar, http://scholar.google.com/

  3. Han, H., Zha, H., Lee, G.C.: Name disambiguation in author citations using a K-way spectral clustering method. In: ACM/IEEE Joint Conference on Digital Libraries, pp. 334–343 (2005)

    Google Scholar 

  4. Hiemstra, D., Hauff, C., Jong, F., Kraaij, W.: SIGIR’s 30th anniversary: an analysis of trends in IR research and the topology of its community. In: SIGIR Forum, vol. 41(2) (2007)

    Google Scholar 

  5. Huang, T., Huang, M.L.: Analysis and Visualization of Co-authorship Networks for Understanding Academic Collaboration and Knowledge Domain of Individual Researchers. In: 4th International Conference on Computer Graphics, Imaging and Visualization, pp. 18–23 (2006)

    Google Scholar 

  6. Ke, W., Börner, K., Viswanath, L.: Major Information Visualization Authors, Papers and Topics in the ACM Library. In: 10th IEEE Symposium on Information Visualization (2004)

    Google Scholar 

  7. Klink, S., Reuther, P., Weber, A., Walter, B., Ley, M.: Analysing Social Networks Within Bibliographical Data. In: Database and Expert Systems Applications, pp. 234–243 (2006)

    Google Scholar 

  8. Lee, C.G., Bollacker, K.D., Lawrence, S.: CiteSeer: An Automatic Citation Indexing System. In: ACM Digital Libraries, pp. 89–98 (1998), http://citeseer.ist.psu.edu/

  9. Lee, D., On, B., Kang, J., Park, S.: Effective and scalable solutions for mixed and split citation problems in digital libraries. In: Workshop on Information Quality in Information Systems, pp. 69–76 (2005)

    Google Scholar 

  10. Ley, M.: DBLP, Computer Science Bibliography, http://www.informatik.uni-trier.de/~ley/db/

  11. Mei, Q., Cai, D., Zhang, D., Zhai, C.: Topic modeling with network regularization. In: 17th International World Wide Web Conferences, pp. 101–110 (2008)

    Google Scholar 

  12. Murray, C., Ke, W., Börner, K.: Mapping Scientific Disciplines and Author Expertise Based on Personal Bibliography Files. In: 10th International Conference on Information Visualisation, vol. IV, pp. 258–263 (2006)

    Google Scholar 

  13. Reuther, P., Walter, B., Ley, M., Weber, A., Klink, S.: Managing the Quality of Person Names in DBLP. In: European Conference on Digital Libraries, pp. 508–511 (2006)

    Google Scholar 

  14. Rodrigues, J.F., Tong, H., Traina, A., Faloutsos, C., Leskovec, J.: GMine: A System for Scalable, Interactive Graph Visualization and Mining. In: 32nd Very Large Data Bases, pp. 1195–1198 (2006)

    Google Scholar 

  15. Shannon, C.E.: The Mathematical Theory of Communication. Bell System Technical Journal 27 (1948)

    Google Scholar 

  16. SIGIR – Special Interest Group on Information Retrieval, http://www.sigir.org/

  17. Smeaton, A.F., Keogh, G., Gurrin, C., McDonald, K., Sødring, T.: Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? SIGIR Forum 37(1) (2003)

    Google Scholar 

  18. Steyvers, M., Smyth, P., Rosen-Zvi, M., Griffiths, T.L.: Probabilistic author-topic models for information discovery. In: Knowledge Discovery and Data Mining, pp. 306–315 (2004)

    Google Scholar 

  19. Wikipedia, the Free Encyclopedia, Lists of People, http://en.wikipedia.org/wiki/

  20. Zhang, H., Qiu, B., Lee, C.G., Foley, H.C., Yen, J.: An LDA-based Community Structure Discovery Approach for LLarge-Scale Social Networks. In: IEEE International Conference on Intelligence and Security Informatics, pp. 200–207. IEEE Press, New York (2007)

    Google Scholar 

  21. Zhang, H., Qiu, B., Lee, C.G., Foley, H.C., Yen, J.: Probabilistic Community Discovery Using Hierarchical Latent Gaussian Mixture Model. In: Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, pp. 663–668 (2007)

    Google Scholar 

  22. Detection of Personal Name Language Origin and Applications (submitted, 2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Biryukov, M. (2008). Co-author Network Analysis in DBLP: Classifying Personal Names. In: Le Thi, H.A., Bouvry, P., Pham Dinh, T. (eds) Modelling, Computation and Optimization in Information Systems and Management Sciences. MCO 2008. Communications in Computer and Information Science, vol 14. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87477-5_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-87477-5_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-87476-8

  • Online ISBN: 978-3-540-87477-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics