Abstract
Name disambiguation is a very critical problem in scientific cooperation network. Ambiguous author names may occur due to the existence of multiple authors with the same name. Despite much research work has been conducted, the problem is still not resolved and becomes even more serious. In this paper, we focus ourselves on such problem. A method of exploiting user feedback for name disambiguation in scientific cooperation network is proposed, which can make use of user feedback to enhance the performance. Furthermore, to make the user feedback more effective, we divide user feedback into three types and assign different weights to them. To evaluate the effectiveness of our proposed method, experiments are conducted with standard public collections. We compare the performance of our proposal with baseline methods. Results show that the proposed algorithm outperforms the previous methods without introducing user interactions. Besides, we investigate into how different types of user feedback can affect the disambiguation results.
Similar content being viewed by others
References
Bekkerman R, McCallum A (2005) Disambiguating web appearances of people in a social network. In: WWW, pp 463–470
Cao H, Jiang D, Pei J, He Q, Liao Z, Chen E, Li H (2008) Context-aware query suggestion by mining click-through and session data. In: KDD, pp 875–883
Chai X, Vuong BQ, Doan A, Naughton JF (2009) Efficiently incorporating user feedback into information extraction and integration programs. In: SIGMOD conference, pp 87–100
Chen Z, Kalashnikov DV, Mehrotra S (2007) Adaptive graphical approach to entity resolution. In: JCDL, pp 204–213
Cota RG, Ferreira AA, Nascimento C, Gonçalves MA, Laender AHF (2010) An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations. JASIST 61(9): 1853–1870
D’Angelo CA, Giuffrida C, Abramo G (2011) A heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments. JASIST 62(2): 257–269
Davis PT, Elson DK, Klavans J (2003) Methods for precise named entity matching in digital collections. In: JCDL, pp 125–127
Ferreira AA, Veloso A, Gonçalves MA, Laender AHF (2010) Effective self-training author name disambiguation in scholarly digital libraries. In: JCDL, pp 39–48
H Yu WK, Hatzivassiloglou V, Wilbur J (2006) A large scale, corpus-based approach for automatically disambiguating biomedical abbreviations. ACM TOIS 24(3): 380C404
Han H, Giles CL, Zha H, Li C, Tsioutsiouliklis K (2004) Two supervised learning approaches for name disambiguation in author citations. In: JCDL, pp 296–305
Han H, Zha H, Giles CL (2005) Name disambiguation in author citations using a k-way spectral clustering method. In: JCDL, pp 334–343
Jin H, Huang L, Yuan P (2009) Name disambiguation using semantic association clustering. In: ICEBE, pp 42–48
Joachims T, Granka LA, Pan B, Hembrooke H, Gay G (2005) Accurately interpreting clickthrough data as implicit feedback. In: SIGIR, pp 154–161
Kang IS, Na SH, Lee S, Jung H, Kim P, Sung WK, Lee JH (2009) On co-authorship for author disambiguation. Inf Process Manag 45(1): 84–97
Lin Q, Wang B, Du Y, Wang X, Li Y (2010) Disambiguating authors by pairwise classification. Tsinghua Sci Technol 15(6): 668–677
Liu Y, Zhang M, Ru L, Ma S (2006) Automatic query type identification based on click through information. In: AIRS, pp 593–600
McRae-Spencer DM, Shadbolt NR (2006) Also by the same author: Aktiveauthor, a citation graph approach to name disambiguation. In: JCDL, pp 53–54
Minkov E, Cohen WW, Ng AY (2006) Contextual search and name disambiguation in email using graphs. In: SIGIR, pp 27–34
Nguyen HT, Cao TH (2010) Exploring wikipedia and text features for named entity disambiguation. In: ACIIDS, vol 2, pp 11–20
Richardson M, Dominowska E, Ragno R (2007) Predicting clicks: estimating the click-through rate for new ads. In: WWW, pp 521–530
Tan YF, Kan MY, Lee D (2006) Search engine driven author disambiguation. In: JCDL, pp 314–315
Treeratpituk P, Giles CL (2009) Disambiguating authors in academic publications using random forests. In: JCDL, pp 39–48
Wang C, Han J, Jia Y, Tang J, Zhang D, Yu Y, Guo J (2010a) Mining advisor-advisee relationships from research publication networks. In: KDD, pp 203–212
Wang F, Tang J, Li J, Wang K (2010) A constraint-based topic modeling approach for name disambiguation. Front Comput Sci China 4(1): 100–111
Whang SE, Menestrina D, Koutrika G, Theobald M, Garcia-Molina H (2009) Entity resolution with iterative blocking. In: SIGMOD, pp 219C232
Wick ML, Rohanimanesh K, Schultz K, McCallum A (2008) A unified approach for schema matching, coreference and canonicalization. In: KDD, pp 722C730
Yang KH, Peng HT, Jiang JY, Lee HM, Ho JM (2008) Author name disambiguation for citations using topic and web correlation. In: ECDL, pp 185–196
Zhang D, Tang J, Li JZ, Wang K (2007) A constraint-based probabilistic framework for name disambiguation. In: CIKM, pp 1019–1022
Zhou Y, Cheng H, Yu JX (2009) Graph clustering based on structural/attribute similarities. PVLDB 2(1): 718–729
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Wen, A., Lin, Q. et al. Name disambiguation in scientific cooperation network by exploiting user feedback. Artif Intell Rev 41, 563–578 (2014). https://doi.org/10.1007/s10462-012-9323-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-012-9323-5