Abstract
The Web of Data is based on two simple ideas: to employ the RDF data model to public structured data on the Web and to set explicit RDF links to interlink data items within different data sources. In this paper, we describe our experience in building a system of link discovery between KAKEN, a database provides the latest information of research projects in Japan, and the DBLP Computer Science Bibliography. Using these links one can navigate from the information of a computer scientist in KAKEN to his publications in the DBLP database. Our problem of linkage between KAKE researchers and DBLP authors is name disambiguation. We proposed combining LDA based topic model and co-author network approach to improve linkage accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bizer, C., Heath, T., Idehen, K., Berners-Lee, T.: Linked data on the web. In: Proceeding of the 17th International Conference on World Wide Web, WWW 2008 (2008)
Bizer, C., Heath, T., Ayers, D., Raimond, Y.: Interlinking Open Data on the Web. In: Demonstrations Track, 4th European Semantic Web Conference, Innsbruck, Austria (2007)
Hassanzaded, O., Consens, M.: Linked movie data base. In: Proceedings of the WWW 2009 Workshop on Linked Data on the Web, Madrid, Spain (2009)
Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Silk - a link discovery framework for the web of data. In: Proceedings of WWW 2009 Workshop on Linked Data on the Web, Madrid, Spain (2009)
Le, N.T., Ichise, R., Le, H.B.: Detecting Hidden Relations in Geographic Data. In: Proceedings of the 4th International Conference on Advances in Semantic Processing, Florence, Italy (2010)
Biryukov, M.: Co-Author Network Analysis in DBLP: Classifying Personal Names. In: 2nd International Conference on Modeling, Computation and Optimization in Information Systems and Management Sciences, Metz, France (2008)
Rosen-Zvi, M., Griffit, T., Steyvers, M., Smyth, P.: The Author-Topic Model for Authors and Documents. In: 20th Conference on Uncertainty in Artificial Intelligence, Banff, Canada (2004)
Reuther, P., Walter, B., Ley, M., Weber, A., Klink, S.: Managing the Quality of Person Names in DBLP. In: Gonzalo, J., Thanos, C., Verdejo, M.F., Carrasco, R.C. (eds.) ECDL 2006. LNCS, vol. 4172, pp. 508–511. Springer, Heidelberg (2006)
Blei, D., Ng, A., Jordan, M.: Latent Dirichlet Allocation. Journal of Machine Learning Research (JMLR) 3, 993–1022 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tran, DH., Takeda, H., Kurakawa, K., Tran, MT. (2012). Combining Topic Model and Co-author Network for KAKEN and DBLP Linking. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Intelligent Information and Database Systems. ACIIDS 2012. Lecture Notes in Computer Science(), vol 7198. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28493-9_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-28493-9_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28492-2
Online ISBN: 978-3-642-28493-9
eBook Packages: Computer ScienceComputer Science (R0)