Citation count prediction as a link prediction problem

Pobiedina, Nataliia; Ichise, Ryutaro

doi:10.1007/s10489-015-0657-y

Citation count prediction as a link prediction problem

Published: 03 April 2015

Volume 44, pages 252–268, (2016)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Nataliia Pobiedina¹ &
Ryutaro Ichise²

2762 Accesses
26 Citations
Explore all metrics

Abstract

The citation count is an important factor to estimate the relevance and significance of academic publications. However, it is not possible to use this measure for papers which are too new. A solution to this problem is to estimate the future citation counts. There are existing works, which point out that graph mining techniques lead to the best results. We aim at improving the prediction of future citation counts by introducing a new feature. This feature is based on frequent graph pattern mining in the so-called citation network constructed on the basis of a dataset of scientific publications. Our new feature improves the accuracy of citation count prediction, and outperforms the state-of-the-art features in many cases which we show with experiments on two real datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

http://www-kdd.isti.cnr.it/GERM/

References

Pobiedina N, Ichise R (2014) Predicting citation counts for academic literature using graph pattern mining. In: Proceeding IEA/AIE, pp 109–119
Garfield E (2001) Impact factors, and why they won’t go away. Science 411(6837):522
Google Scholar
Hirsch J (2005) An index to quantify an individual’s scientific research output. Proc the National Academy of Sciences of the United States America 102(46):16569
Article Google Scholar
Beel J, Gipp B (2009) Google scholar’s ranking algorithm: The impact of citation counts (an empirical study). In: Proceeding RCIS, pp 439–446
Bethard S, Jurafsky D (2010) Who should I cite: learning literature search models from citation behavior. In: Proceeding CIKM, pp 609–618
Callaham M, Wears R, Weber E (2002) Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. J. Am. Med. Assoc. 287(21):2847–50
Article Google Scholar
Kulkarni AV, Busse JW, Shams I (2007) Characteristics associated with citation rate of the medical literature. PLOS One 2(5)
Didegah F, Thelwall M (2013) Determinants of research citation impact in nanoscience and nanotechnology. JASIST (JASIS) 64(5):1055–1064
Article Google Scholar
Livne A, Adar E, Teevan J, Dumais S (2013) Predicting citation counts using text and graph mining. In: Proceeding the iConference 2013 Workshop on Computational Scientometrics: Theory and Applications
Bringmann B, Berlingerio M, Bonchi F, Gionis A (2010) Learning and predicting the evolution of social networks. IEEE Intell Syst 25:26–35
Article Google Scholar
Yan R, Tang J, Liu X, Shan D, Li X (2011) Citation count prediction: learning to estimate future citations for literature. In: Proceeding CIKM, pp 1247–1252
Mcgovern A, Friedl L, Hay M, Gallagher B, Fast A, Neville J, Jensen D (2003) Exploiting relational structure to understand publication patterns in high-energy physics. SIGKDD Explorations 5:2003
Article Google Scholar
Yan R, Huang C, Tang J, Zhang Y, Li X (2012) To better stand on the shoulder of giants. In: Proceeding JCDL, pp 51– 60
Barabasi AL, Albert R (1999) Emergence of scaling in random networks. Sci Mag 286(5439):509–512
MathSciNet Google Scholar
Adamic LA, Adar E (2003) Friends and neighbors on the web. Soc Networks 25(3):211–230
Article Google Scholar
Liben-Nowell D (2007) The link-prediction problem for social networks. JASIST 58(7):1019–1031
Article Google Scholar
Munasinghe L, Ichise R (2012) Time score: A new feature for link prediction in social networks. IEICE Trans 95-D(3):821–828
Google Scholar
Shi X, Leskovec J, McFarland D A (2010) Citing for high impact. In: Proceeding JCDL, pp 49–58
Devroye L, Gyrfi L, Lugosi G (1996) A Probabilistic Theory of Pattern Recognition. Springer
Chang CC, Lin CJ (2011) Libsvm: A library for support vector machines. ACM Trans Intell Syst Technol 2(3):1–27
Article Google Scholar
Hothorn T, Hornik K, Zeileis A (2006) Unbiased recursive partitioning: A conditional inference framework. J Comp Graph Stat 15(3):651–674
Article MathSciNet Google Scholar
Breiman L, Friedman J, Stone C J, Olshen R (1984) Classification and Regression Trees. Chapman and Hall/CRC
The R project for statistical computing http://www.r-project.org/ (January 2013)
Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10(7):1895–1923
Article Google Scholar
Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inf Process Manag 45(4):427–437
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
Nataliia Pobiedina
Principles of Informatics Research Division, National Institute of Informatics, Tokyo, Japan
Ryutaro Ichise

Authors

Nataliia Pobiedina
View author publications
You can also search for this author in PubMed Google Scholar
Ryutaro Ichise
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nataliia Pobiedina.

Additional information

This is an extended and enhanced version of the results published in [1].

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pobiedina, N., Ichise, R. Citation count prediction as a link prediction problem. Appl Intell 44, 252–268 (2016). https://doi.org/10.1007/s10489-015-0657-y

Download citation

Published: 03 April 2015
Issue Date: March 2016
DOI: https://doi.org/10.1007/s10489-015-0657-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Citation count prediction as a link prediction problem

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence and machine learning for disaster prediction: a scientometric analysis of highly cited papers

Citation-based clustering of publications using CitNetExplorer and VOSviewer

Visualizing Bibliometric Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Citation count prediction as a link prediction problem

Abstract

Access this article

Similar content being viewed by others

Artificial intelligence and machine learning for disaster prediction: a scientometric analysis of highly cited papers

Citation-based clustering of publications using CitNetExplorer and VOSviewer

Visualizing Bibliometric Networks

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation