Skip to main content
Log in

A review of scientific impact prediction: tasks, features and methods

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

With the rapid evolution of scientific research, there are a huge volume of papers published every year and the number of scholars is also growing fast. How to effectively predict the scientific impact has become an important research problem, attracting the attention of researchers in various fields, and it is of great significance in improving research efficiency and assisting in decision-making and scientific evaluation. In this paper, we propose a new framework to perform a systematical survey of scientific impact prediction research. Specifically, we take the four common academic entities into account: papers, scholars, venues and institutions. We reviewed all the prediction tasks reported in the literature in detail; the input features are divided into six groups: paper-related, author-related, venue-related, institution-related, network-related and altmetrics-related. Moreover, we classify the forecasting methods into mathematical statistics-based, traditional machine learning-based, deep learning-based and graph-based, and subdivide each category according to the characteristics. Finally, we discuss open issues and existing challenges, and provide potential research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. 1 http://www.arnetminer.org/data.

  2. 2 https://journals.aps.org/datasets.

  3. 3 https://kdd.org/kdd-cup/view/kdd-cup-2003/Data.

  4. 4 https://github.com/Lucaweihs/impact-prediction/.

References

  • Abbasi, A., Altmann, J., & Hossain, L. (2011). Identifying the effects of co-authorship networks on the performance of scholars: A correlation and regression analysis of performance measures and social network analysis measures. Journal of Informetrics, 5(4), 594–607.

    Article  Google Scholar 

  • Abramo, G., D’Angelo, C. A., & Felici, G. (2019a). Predicting publication long-term impact through a combination of early citations and journal impact factor. Journal of Informetrics, 13(1), 32–49.

    Article  Google Scholar 

  • Abramo, G., D’Angelo, C. A., & Reale, E. (2019b). Peer review versus bibliometrics: Which method better predicts the scholarly impact of publications? Scientometrics, 121(1), 537–554.

    Article  Google Scholar 

  • Abrishami, A., & Aliakbary, S. (2019). Predicting citation counts based on deep neural network learning techniques. Journal of Informetrics, 13(2), 485–499.

    Article  Google Scholar 

  • Acuna, D. E., Allesina, S., & Kording, K. P. (2012). Predicting scientific success. Nature, 489(7415), 201–202.

    Article  Google Scholar 

  • Akella, A. P., Alhoori, H., Kondamudi, P. R., Freeman, C., & Zhou, H. (2021). Early indicators of scientific impact: Predicting citations with altmetrics. Journal of Informetrics, 15(2), 101128.

    Article  Google Scholar 

  • Ashton, S. V., & Oppenheim, C. (1978). A method of predicting Nobel prizewinners in chemistry. Social Studies of Science, 8(3), 341–348.

    Article  Google Scholar 

  • Ayaz, S., Masood, N., & Islam, M. A. (2018). Predicting scientific impact based on h-index. Scientometrics, 114(3), 993–1010.

    Article  Google Scholar 

  • Bai, X., Liu, H., Zhang, F., Ning, Z., Kong, X., Lee, I., & Xia, F. (2017a). An overview on evaluating and predicting scholarly article impact. Information, 8(3), 73.

    Article  Google Scholar 

  • Bai, X., Zhang, F., Hou, J., Xia, F., Tolba, A., & Elashkar, E. (2017b). Implicit multi-feature learning for dynamic time series prediction of the impact of institutions. IEEE Access, 5, 16372–16382.

    Article  Google Scholar 

  • Bai, X., Pan, H., Hou, J., Guo, T., Lee, I., & Xia, F. (2020). Quantifying success in science: An overview. IEEE Access, 8, 123200–123214.

    Article  Google Scholar 

  • Bai, X., Zhang, F., & Lee, I. (2019). Predicting the citations of scholarly paper. Journal of Informetrics, 13(1), 407–418.

    Article  Google Scholar 

  • Bento, C., Martins, B., & Calado, P. (2013). Predicting the Future Impact of Academic Publications. In L. Correia, L. P. Reis, & J. Cascalho (Eds.), Portuguese Conference on Artificial Intelligence (pp. 366–377). Springer.

    Google Scholar 

  • Bertsimas, D., Brynjolfsson, E., Reichman, S., & Silberhoz, J. (2013). Network analysis for predicting academic impact. Proceedings of the 34th International Conference on Information Systems (ICIS), 92.

  • Bhat, H. S., Huang, L.-H., Rodriguez, S., Dale, R., & Heit, E. (2015). Citation prediction using diverse features. IEEE International Conference on Data Mining Workshop (ICDMW), 2015, 589–596.

    Article  Google Scholar 

  • Bornmann, L., & Daniel, H.-D. (2010). Citation speed as a measure to predict the attention an article receives: An investigation of the validity of editorial decisions at Angewandte Chemie International Edition. Journal of Informetrics, 4(1), 83–88.

    Article  Google Scholar 

  • Bornmann, L., Leydesdorff, L., & Wang, J. (2014). How to improve the prediction based on citation impact percentiles for years shortly after the publication date? Journal of Informetrics, 8(1), 175–180.

    Article  Google Scholar 

  • Brizan, D. G., Gallagher, K., Jahangir, A., & Brown, T. (2016). Predicting citation patterns: Defining and determining influence. Scientometrics, 108(1), 183–200.

    Article  Google Scholar 

  • Brody, T., Harnad, S., & Carr, L. (2006). Earlier web usage statistics as predictors of later citation impact. Journal of the American Society for Information Science and Technology, 57(8), 1060–1072.

    Article  Google Scholar 

  • Bütün, E., & Kaya, M. (2019). Predicting citation count of scientists as a link prediction problem. IEEE Transactions on Cybernetics, 50(10), 4518–4529.

    Article  Google Scholar 

  • Cao, X., Chen, Y., & Liu, K. R. (2016). A data analytic approach to quantifying scientific impact. Journal of Informetrics, 10(2), 471–484.

    Article  Google Scholar 

  • Chakraborty, T., Kumar, S., Goyal, P., Ganguly, N., & Mukherjee, A. (2014). Towards a stratified learning approach to predict future citation counts. IEEE/ACM Joint Conference on Digital Libraries, 351–360.

  • Chawla, D. S. (2021). Frosty reception for algorithm that predicts research papers’ impact. Nature.

  • Cheang, B., Chu, S. K. W., Li, C., & Lim, A. (2014a). A multidimensional approach to evaluating management journals: Refining PageRank via the differentiation of citation types and identifying the roles that management journals play. Journal of the Association for Information Science and Technology, 65(12), 2581–2591.

    Article  Google Scholar 

  • Cheang, B., Chu, S. K. W., Li, C., & Lim, A. (2014b). OR/MS journals evaluation based on a refined PageRank method: An updated and more comprehensive review. Scientometrics, 100(2), 339–361.

    Article  Google Scholar 

  • Cheang, B., Li, C., Lim, A., & Zhang, Z. (2015). Identifying patterns and structural influences in the scientific communication of business knowledge. Scientometrics, 103(1), 159–189.

    Article  Google Scholar 

  • Chen, C. (2012). Predictive effects of structural variation on citation counts. Journal of the American Society for Information Science and Technology, 63(3), 431–449.

    Article  Google Scholar 

  • Cui, P., Shen, Z., Li, S., Yao, L., Li, Y., Chu, Z., & Gao, J. (2020). Causal inference meets machine learning. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 3527–3528.

  • Cummings, D., & Nassar, M. (2020). Structured citation trend prediction using graph neural networks. ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3897–3901.

  • Danell, R. (2011). Can the quality of scientific work be predicted using information on the author’s track record? Journal of the American Society for Information Science and Technology, 62(1), 50–60.

    Article  Google Scholar 

  • Daud, A., Aljohani, N. R., Abbasi, R. A., Rafique, Z., Amjad, T., Dawood, H., & Alyoubi, K. H. (2017). Finding rising stars in co-author networks via weighted mutual influence. Proceedings of the 26th International Conference on World Wide Web Companion, 33–41.

  • Daud, A., Abbasi, R., & Muhammad, F. (2013). Finding rising stars in social networks. In W. Meng, L. Feng, S. Bressan, W. Winiwarter, & W. Song (Eds.), International conference on database systems for advanced applications (pp. 13–24). Springer.

    Chapter  Google Scholar 

  • Daud, A., Ahmad, M., Malik, M. S. I., & Che, D. (2015). Using machine learning techniques for rising star prediction in co-author network. Scientometrics, 102(2), 1687–1711.

    Article  Google Scholar 

  • Davletov, F., Aydin, A. S., & Cakmak, A. (2014). High impact academic paper prediction using temporal and topological features. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 491–498.

  • de Abreu Batista-Jr, A., Gouveia, F. C., & Mena-Chalco, J. P. (2021). Predicting the Q of junior researchers using data from the first years of publication. Journal of Informetrics, 15(2), 101130.

    Article  Google Scholar 

  • Dey, R., Roy, A., Chakraborty, T., & Ghosh, S. (2017). Sleeping beauties in computer science: Characterization and early identification. Scientometrics, 113(3), 1645–1663.

    Article  Google Scholar 

  • Ding, Y., Zhang, G., Chambers, T., Song, M., Wang, X., & Zhai, C. (2014). Content-based citation analysis: The next generation of citation analysis. Journal of the Association for Information Science and Technology, 65(9), 1820–1833.

    Article  Google Scholar 

  • Dong, Y., Johnson, R. A., & Chawla, N. V. (2016). Can scientific impact be predicted? IEEE Transactions on Big Data, 2(1), 18–30.

    Article  Google Scholar 

  • Drongstrup, D., Malik, S., Aljohani, N. R., Alelyani, S., Safder, I., & Hassan, S.-U. (2020). Can social media usage of scientific literature predict journal indices of AJG, SNIP and JCR? An altmetric study of Economics. Scientometrics, 125(2), 1541–1558.

    Article  Google Scholar 

  • Du, W., Xie, Z., & Lv, Y. (2021). Predicting publication productivity for authors: Shallow or deep architecture? Scientometrics, 126(7), 5855–5879.

    Article  Google Scholar 

  • Fong, E. A., & Wilhite, A. W. (2017). Authorship and citation manipulation in academic research. PLoS ONE, 12(12), e0187394.

    Article  Google Scholar 

  • Fortunato, S., Bergstrom, C. T., Börner, K., Evans, J. A., Helbing, D., Milojević, S., Petersen, A. M., Radicchi, F., Sinatra, R., & Uzzi, B. (2018). Science of science. Science, 359(6379), eaao0185.

    Article  Google Scholar 

  • Fronzetti Colladon, A., D’Angelo, C. A., & Gloor, P. A. (2020). Predicting the future success of scientific publications through social network and semantic analysis. Scientometrics, 124(1), 357–377.

    Article  Google Scholar 

  • Fu, L., & Aliferis, C. (2010). Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature. Scientometrics, 85(1), 257–270.

    Article  Google Scholar 

  • García-Pérez, M. A. (2013). Limited validity of equations to predict the future h index. Scientometrics, 96(3), 901–909.

    Article  Google Scholar 

  • Gardfield, E. (1977). The 250 “most-cited primary authors, 1961–1975” Part II: The correlation between citedness, noble prizes and academy memberships. Current Comments, 50, 5–15.

    Google Scholar 

  • Gingras, Y., & Wallace, M. (2010). Why it has become more difficult to predict Nobel Prize winners: A bibliometric analysis of nominees and winners of the chemistry and physics prizes (1901–2007). Scientometrics, 82(2), 401–412.

    Article  Google Scholar 

  • Giuffrida, C., Abramo, G., & D’Angelo, C. A. (2019). Are all citations worth the same? Valuing citations by the value of the citing items. Journal of Informetrics, 13(2), 500–514.

    Article  Google Scholar 

  • Ha, L., Jiang, W., Bi, C., Zhang, R., Zhang, T., & Wen, X. (2016). How online usage of subscription-based journalism and mass communication research journal articles predicts citations. Learned Publishing, 29(3), 183–192.

    Article  Google Scholar 

  • Haslam, N., Ban, L., Kaufmann, L., Loughnan, S., Peters, K., Whelan, J., & Wilson, S. (2008). What makes an article influential? Predicting impact in social and personality psychology. Scientometrics, 76(1), 169–185.

    Article  Google Scholar 

  • Hirsch, J. E. (2007). Does the h index have predictive power? Proceedings of the National Academy of Sciences, 104(49), 19193–19198.

    Article  Google Scholar 

  • Holm, A. N., Plank, B., Wright, D., & Augenstein, I. (2020). Longitudinal citation prediction using temporal graph neural networks. ArXiv Preprint ArXiv: 2012.05742.

  • Hou, J., Pan, H., Guo, T., Lee, I., Kong, X., & Xia, F. (2019). Prediction methods and applications in the science of science: A survey. Computer Science Review, 34, 100197.

    Article  Google Scholar 

  • Hu, Y.-H., Tai, C.-T., Liu, K. E., & Cai, C.-F. (2020). Identification of highly-cited papers using topic-model-based and bibliometric features: The consideration of keyword popularity. Journal of Informetrics, 14(1), 101004.

    Article  Google Scholar 

  • Ibáñez, A., Larrañaga, P., & Bielza, C. (2011). Predicting the h-index with cost-sensitive naive Bayes. 2011 11th International Conference on Intelligent Systems Design and Applications, 599–604.

  • Jensen, P., Rouquier, J.-B., & Croissant, Y. (2009). Testing bibliometric indicators by their prediction of scientists promotions. Scientometrics, 78(3), 467–479.

    Article  Google Scholar 

  • Jiang, S., Koch, B., & Sun, Y. (2021). HINTS: Citation time series prediction for new publications via dynamic heterogeneous information network embedding. Proceedings of the Web Conference, 2021, 3158–3167.

    Google Scholar 

  • Kanellos, I., Vergoulis, T., Sacharidis, D., Dalamagas, T., & Vassiliou, Y. (2021). Ranking papers by their short-term scientific impact. 2021 IEEE 37th International Conference on Data Engineering (ICDE), 1997–2002.

  • Ke, Q., Ferrara, E., Radicchi, F., & Flammini, A. (2015). Defining and identifying sleeping beauties in science. Proceedings of the National Academy of Sciences, 112(24), 7426–7431.

    Article  Google Scholar 

  • Klemiński, R., Kazienko, P., & Kajdanowicz, T. (2021). Where should I publish? Heterogeneous, networks-based prediction of paper’s citation success. Journal of Informetrics, 15(3), 101200.

    Article  Google Scholar 

  • Klimek, P., Jovanovic, S., Egloff, A., & Schneider, R. (2016). Successful fish go with the flow: Citation impact prediction based on centrality measures for term–document networks. Scientometrics, 107(3), 1265–1282.

    Article  Google Scholar 

  • Kong, X., Zhang, J., Zhang, D., Bu, Y., Ding, Y., & Xia, F. (2020). The gene of scientific success. ACM Transactions on Knowledge Discovery from Data (TKDD), 14(4), 1–19.

    Article  Google Scholar 

  • Laurance, W. F., Useche, D. C., Laurance, S. G., & Bradshaw, C. J. (2013). Predicting publication success for biologists. BioScience, 63(10), 817–823.

    Article  Google Scholar 

  • Lee, D. H. (2019). Predicting the research performance of early career scientists. Scientometrics, 121(3), 1481–1504.

    Article  Google Scholar 

  • Levitt, J. M., & Thelwall, M. (2011). A combined bibliometric indicator to predict article impact. Information Processing & Management, 47(2), 300–308.

    Article  Google Scholar 

  • Li, X.-L., Foo, C. S., Tew, K. L., & Ng, S.-K. (2009). Searching for rising stars in bibliography networks. International Conference on Database Systems for Advanced Applications, 288–292.

  • Li, C.-T., Lin, Y.-J., Yan, R., & Yeh, M.-Y. (2015). Trend-based citation count prediction for research articles. Pacific-Asia Conference on Knowledge Discovery and Data Mining, 659–671.

  • Li, L., & Tong, H. (2015). The child is father of the man: Foresee the success at the early stage. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 655–664.

  • Li, S., Zhao, W. X., Yin, E. J., & Wen, J.-R. (2019a). A neural citation count prediction model based on peer review text. Proceedings of the 2019a Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4914–4924.

  • Li, W., Aste, T., Caccioli, F., & Livan, G. (2019b). Early coauthorship with top scientists predicts success in academic careers. Nature Communications, 10(1), 1–9.

    Article  Google Scholar 

  • Lindahl, J. (2018). Predicting research excellence at the individual level: The importance of publication rate, top journal publications, and top 10% publications in the case of early career mathematicians. Journal of Informetrics, 12(2), 518–533.

    Article  Google Scholar 

  • Lindahl, J., Colliander, C., & Danell, R. (2020). Early career performance and its correlation with gender and publication output during doctoral education. Scientometrics, 122(1), 309–330.

    Article  Google Scholar 

  • Liu, L., Yu, D., Wang, D., & Fukumoto, F. (2020). Citation count prediction based on neural hawkes model. IEICE Transactions on Information and Systems, 103(11), 2379–2388.

    Article  Google Scholar 

  • Livne, A., Adar, E., Teevan, J., & Dumais, S. (2013). Predicting citation counts using text and graph mining. Proc. the IConference 2013 Workshop on Computational Scientometrics: Theory and Applications, 1–4.

  • Luo, Z., He, J., Qian, J., Wang, Y., Chen, J., & Lu, W. (2020). Can scientific publication’s network structural features predict its citation? Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in, 2020, 485–486.

    Google Scholar 

  • Ma, A., Liu, Y., Xu, X., & Dong, T. (2021). A deep-learning based citation count prediction model with paper metadata semantic features. Scientometrics, 126(8), 6803–6823.

    Article  Google Scholar 

  • Ma, Y., & Uzzi, B. (2018). Scientific prize network predicts who pushes the boundaries of science. Proceedings of the National Academy of Sciences, 115(50), 12608–12615.

    Article  Google Scholar 

  • Mahalakshmi, G. S., Sendhilkumar, S., Jancy, P., & Easwarakumar, K. S. (2020). A Neural Learning Approach for Prediction of Research Citations Using Article Semantics. 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS), 816–819.

  • Mazloumian, A. (2012). Predicting scholars’ scientific impact. PLoS ONE, 7(11), e49246.

    Article  Google Scholar 

  • Mistele, T., Price, T., & Hossenfelder, S. (2019). Predicting authors’ citation counts and h-indices with a neural network. Scientometrics, 120(1), 87–104.

    Article  Google Scholar 

  • Nezhadbiglari, M., Gonçalves, M. A., & Almeida, J. M. (2016). Early prediction of scholar popularity. Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 181–190.

  • Nie, Y., Zhu, Y., Lin, Q., Zhang, S., Shi, P., & Niu, Z. (2019). Academic rising star prediction via scholar’s evaluation model and machine learning techniques. Scientometrics, 120(2), 461–476.

    Article  Google Scholar 

  • Onodera, N., & Yoshikane, F. (2015). Factors affecting citation rates of research articles. Journal of the Association for Information Science and Technology, 66(4), 739–764.

    Article  Google Scholar 

  • Panagopoulos, G., Tsatsaronis, G., & Varlamis, I. (2017). Detecting rising stars in dynamic collaborative networks. Journal of Informetrics, 11(1), 198–222.

    Article  Google Scholar 

  • Park, H.-M., Sinshaw, Y. B., & Sohn, K.-A. (2017). Temporal citation network-based feature extraction for cited count prediction. International Conference on Mobile and Wireless Technology, 380–388.

  • Penner, O., Pan, R. K., Petersen, A. M., Kaski, K., & Fortunato, S. (2013). On the predictability of future impact in science. Scientific Reports, 3(1), 1–8.

    Article  Google Scholar 

  • Pobiedina, N., & Ichise, R. (2016). Citation count prediction as a link prediction problem. Applied Intelligence, 44(2), 252–268.

    Article  Google Scholar 

  • Põder, E. (2017). A framework for the measurement and prediction of an individual scientist’s performance. Trames, 21(1), 3–14.

    Article  Google Scholar 

  • Porwal, P., & Devare, M. H. (2020). Citation Classification Prediction Implying Text Features Using Natural Language Processing and Supervised Machine Learning Algorithms. International Conference on Recent Trends in Image Processing and Pattern Recognition, 540–552.

  • Qian, Y., Dong, Y., Ma, Y., Jin, H., & Li, J. (2016). Feature engineering and ensemble modeling for paper acceptance rank prediction. ArXiv Preprint ArXiv: 1611.04369.

  • Rokach, L., Kalech, M., Blank, I., & Stern, R. (2011). Who is going to win the next association for the advancement of artificial intelligence fellowship award? Evaluating researchers by mining bibliographic data. Journal of the American Society for Information Science and Technology, 62(12), 2456–2470.

    Article  Google Scholar 

  • Ruan, X., Zhu, Y., Li, J., & Cheng, Y. (2020). Predicting the citation counts of individual papers via a BP neural network. Journal of Informetrics, 14(3), 101039.

    Article  Google Scholar 

  • Sandulescu, V., & Chiru, M. (2016). Predicting the future relevance of research institutions-The winning solution of the KDD Cup 2016. ArXiv Preprint ArXiv: 1609.02728.

  • Sayyadi, H., & Getoor, L. (2009). Futurerank: Ranking scientific articles by predicting their future pagerank. Proceedings of the 2009 SIAM International Conference on Data Mining, 533–544.

  • Schreiber, M. (2013). How relevant is the predictive power of the h-index? A case study of the time-dependent Hirsch index. Journal of Informetrics, 7(2), 325–329.

    Article  Google Scholar 

  • Shen, H., Wang, D., Song, C., & Barabási, A.-L. (2014). Modeling and predicting popularity dynamics via reinforced poisson processes. Proceedings of the AAAI Conference on Artificial Intelligence. https://doi.org/10.1609/aaai.v28i1.8739

    Article  Google Scholar 

  • Shuang, Q. (2016). Heterogenous Graph Mining for Measuring the Impact of Research Institutions.

  • Sinatra, R., Wang, D., Deville, P., Song, C., & Barabási, A.-L. (2016). Quantifying the evolution of individual scientific impact. Science, 354(6312), aaf5239.

    Article  Google Scholar 

  • Singh, M., Patidar, V., Kumar, S., Chakraborty, T., Mukherjee, A., & Goyal, P. (2015). The role of citation context in predicting long-term citation profiles: An experimental study based on a massive bibliographic text dataset. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, 1271–1280.

  • Skarding, J., Gabrys, B., & Musial, K. (2021). Foundations and modeling of dynamic networks using dynamic graph neural networks: A survey. IEEE Access, 9, 79143–79168.

    Article  Google Scholar 

  • Sohrabi, B., & Iraj, H. (2017). The effect of keyword repetition in abstract and keyword frequency per journal in predicting citation counts. Scientometrics, 110(1), 243–251.

    Article  Google Scholar 

  • Stegehuis, C., Litvak, N., & Waltman, L. (2015). Predicting the long-term citation impact of recent publications. Journal of Informetrics, 9(3), 642–657.

    Article  Google Scholar 

  • Stern, D. I. (2014). High-ranked social science journal articles can be identified from early citation information. PLoS ONE, 9(11), e112520. https://doi.org/10.1371/journal.pone.0112520

    Article  Google Scholar 

  • Tahamtan, I., Safipour Afshar, A., & Ahamdzadeh, K. (2016). Factors affecting number of citations: A comprehensive review of the literature. Scientometrics, 107(3), 1195–1225.

    Article  Google Scholar 

  • Thelwall, M., & Nevill, T. (2018). Could scientists use Altmetric: Com scores to predict longer term citation counts? Journal of Informetrics, 12(1), 237–248.

    Article  Google Scholar 

  • Timilsina, M., Davis, B., Taylor, M., & Hayes, C. (2016). Towards predicting academic impact from mainstream news and weblogs: A heterogeneous graph based approach. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2016, 1388–1389.

    Google Scholar 

  • Valderrama, P., Escabias, M., Jiménez-Contreras, E., Valderrama, M. J., & Baca, P. (2018). A mixed longitudinal and cross-sectional model to forecast the journal impact factor in the field of Dentistry. Scientometrics, 116(2), 1203–1212.

    Article  Google Scholar 

  • Van Dijk, D., Manor, O., & Carey, L. B. (2014). Publication metrics and success on the academic job market. Current Biology, 24(11), R516–R517.

    Article  Google Scholar 

  • Van Noorden, R. (2017). The science That’s. Nature, 552, 162–164.

    Article  Google Scholar 

  • Van Raan, A. F. (2004). Sleeping beauties in science. Scientometrics, 59(3), 467–472.

    Article  Google Scholar 

  • Walker, D., Xie, H., Yan, K.-K., & Maslov, S. (2007). Ranking scientific publications using a model of network traffic. Journal of Statistical Mechanics: Theory and Experiment, 2007(06), P06010.

    Article  Google Scholar 

  • Wan, X., & Liu, F. (2014). Are all literature citations equally important? Automatic citation strength estimation and its applications. Journal of the Association for Information Science and Technology, 65(9), 1929–1938.

    Article  Google Scholar 

  • Wang, S., Xie, S., Zhang, X., Li, Z., Yu, P. S., & Shu, X. (2014). Future influence ranking of scientific literature. Proceedings of the 2014 SIAM International Conference on Data Mining, 749–757.

  • Wang, D., Song, C., & Barabási, A.-L. (2013). Quantifying long-term scientific impact. Science, 342(6154), 127–132.

    Article  Google Scholar 

  • Wang, F., Fan, Y., Zeng, A., & Di, Z. (2019a). Can we predict ESI highly cited publications? Scientometrics, 118(1), 109–125.

    Article  Google Scholar 

  • Wang, M., Jiao, S., Chai, K.-H., & Chen, G. (2019b). Building journal’s long-term impact: Using indicators detected from the sustained active articles. Scientometrics, 121(1), 261–283.

    Article  Google Scholar 

  • Wang, M., Wang, Z., & Chen, G. (2019c). Which can better predict the future success of articles? Bibliometric indices or alternative metrics. Scientometrics, 119(3), 1575–1595.

    Article  Google Scholar 

  • Wang, J., Zhang, F., Li, Y., & Liu, D. (2020). Attention-based multi-fusion method for citation prediction. In J.-S. Pan, J. Li, P.-W. Tsai, & L. C. Jain (Eds.), Advances in intelligent information hiding and multimedia signal processing (pp. 315–322). Springer.

    Chapter  Google Scholar 

  • Wang, K., Shi, W., Bai, J., Zhao, X., & Zhang, L. (2021a). Prediction and application of article potential citations based on nonlinear citation-forecasting combined model. Scientometrics, 126(8), 6533–6550.

    Article  Google Scholar 

  • Wang, W., Zhang, J., Zhou, F., Chen, P., & Wang, B. (2021b). Paper acceptance prediction at the institutional level based on the combination of individual and network features. Scientometrics, 126(2), 1581–1597.

    Article  Google Scholar 

  • Wang, M., Yu, G., Xu, J., He, H., Yu, D., & An, S. (2012). Development a case-based classifier for predicting highly cited papers. Journal of Informetrics, 6(4), 586–599.

    Article  Google Scholar 

  • Wang, M., Yu, G., & Yu, D. (2011). Mining typical features for highly cited papers. Scientometrics, 87(3), 695–706.

    Article  Google Scholar 

  • Wang, S., Xie, S., Zhang, X., Li, Z., Yu, P. S., & He, Y. (2016). Coranking the future influence of multiobjects in bibliographic network through mutual reinforcement. ACM Transactions on Intelligent Systems and Technology (TIST), 7(4), 1–28.

    Article  Google Scholar 

  • Way, S. F., Morgan, A. C., Clauset, A., & Larremore, D. B. (2017). The misleading narrative of the canonical faculty productivity trajectory. Proceedings of the National Academy of Sciences, 114(44), E9216–E9223.

    Article  Google Scholar 

  • Way, S. F., Morgan, A. C., Larremore, D. B., & Clauset, A. (2019). Productivity, prominence, and the effects of academic environment. Proceedings of the National Academy of Sciences, 116(22), 10729–10733.

    Article  Google Scholar 

  • Weihs, L., & Etzioni, O. (2017). Learning to predict citation-based impact measures. ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2017, 1–10.

    Google Scholar 

  • Weis, J. W., & Jacobson, J. M. (2021). Learning on knowledge graph dynamics provides an early warning of impactful research. Nature Biotechnology, 39(10), 1300–1307.

    Article  Google Scholar 

  • Wen, J., Wu, L., & Chai, J. (2020). Paper citation count prediction based on recurrent neural network with gated recurrent unit. 2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC), 303–306.

  • Wilson, J., Mohan, R., Arif, M., Chaudhury, S., & Lall, B. (2016). Ranking academic institutions on potential paper acceptance in upcoming conferences. ArXiv Preprint ArXiv: 1610.02828.

  • Wu, X., Fu, Q., & Rousseau, R. (2008). On indexing in the Web of Science and predicting journal impact factor. Journal of Zhejiang University Science B, 9(7), 582–590.

    Article  Google Scholar 

  • Wu, Z., Lin, W., Liu, P., Chen, J., & Mao, L. (2019). Predicting long-term scientific impact based on multi-field feature extraction. IEEE Access, 7, 51759–51770.

    Article  Google Scholar 

  • Xia, F., Wang, W., Bekele, T. M., & Liu, H. (2017). Big scholarly data: A survey. IEEE Transactions on Big Data, 3(1), 18–35.

    Article  Google Scholar 

  • Xiao, C., Han, J., Fan, W., Wang, S., Huang, R., & Zhang, Y. (2019). Predicting scientific impact via heterogeneous academic network embedding. Pacific Rim International Conference on Artificial Intelligence, 555–568.

  • Xiao, C., Sun, L., Han, J., & Qiao, Y. (2021). Heterogeneous academic network embedding based multivariate random-walk model for predicting scientific impact. Applied Intelligence, 1–18.

  • Xie, Z. (2020). Predicting publication productivity for researchers: A piecewise Poisson model. Journal of Informetrics, 14(3), 101065.

    Article  Google Scholar 

  • Xu, J., Li, M., Jiang, J., Ge, B., & Cai, M. (2019). Early prediction of scientific impact based on multi-bibliographic features and convolutional neural network. IEEE Access, 7, 92248–92258.

    Article  Google Scholar 

  • Yan, R., Tang, J., Liu, X., Shan, D., & Li, X. (2011). Citation count prediction: Learning to estimate future citations for literature. Proceedings of the 20th ACM International Conference on Information and Knowledge Management, 1247–1252.

  • Yu, T., Yu, G., Li, P.-Y., & Wang, L. (2014). Citation impact prediction for scientific papers using stepwise regression analysis. Scientometrics, 101(2), 1233–1252.

    Article  Google Scholar 

  • Yu, X., Szymanski, B. K., & Jia, T. (2021). Become a better you: Correlation between the change of research direction and the change of scientific performance. Journal of Informetrics, 15(3), 101193.

    Article  Google Scholar 

  • Yuan, S., Tang, J., Zhang, Y., Wang, Y., & Xiao, T. (2018). Modeling and predicting citation count via recurrent neural network with long short-term memory. ArXiv Preprint ArXiv: 1811.02129.

  • Zhang, C., Liu, C., Yu, L., Zhang, Z.-K., & Zhou, T. (2017). Identifying the academic rising stars via pairwise citation increment ranking. Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, 475–483.

  • Zhang, F., Bai, X., & Lee, I. (2019). Author impact: Evaluations, predictions, and challenges. IEEE Access, 7, 38657–38669.

    Article  Google Scholar 

  • Zhang, F., & Wu, S. (2020). Predicting future influence of papers, researchers, and venues in a dynamic academic network. Journal of Informetrics, 14(2), 101035.

    Article  Google Scholar 

  • Zhang, F., & Wu, S. (2021). Measuring academic entities’ impact by content-based citation analysis in a heterogeneous academic network. Scientometrics, 126(8), 7197–7222.

    Article  Google Scholar 

  • Zhang, J., Ning, Z., Bai, X., Wang, W., Yu, S., & Xia, F. (2016a). Who are the rising stars in academia? IEEE/ACM Joint Conference on Digital Libraries (JCDL), 2016, 211–212.

    Article  Google Scholar 

  • Zhang, J., Xia, F., Wang, W., Bai, X., Yu, S., Bekele, T. M., & Peng, Z. (2016b). Cocarank: A collaboration caliber-based method for finding academic rising stars. Proceedings of the 25th International Conference Companion on World Wide Web, 395–400.

  • Zhang, J., Xu, B., Liu, J., Tolba, A., Al-Makhadmeh, Z., & Xia, F. (2018a). PePSI: Personalized prediction of scholars’ impact in heterogeneous temporal academic networks. IEEE Access, 6, 55661–55672.

    Article  Google Scholar 

  • Zhang, L., Xie, Y., Xidao, L., & Zhang, X. (2018b). Multi-source heterogeneous data fusion. International Conference on Artificial Intelligence and Big Data (ICAIBD), 2018, 47–51.

    Google Scholar 

  • Zhang, X., Fu, L., & Wang, X. (2018c). Ranking the Future Influence of Scientific Literatures. 2018c IEEE 4th International Conference on Computer and Communications (ICCC), 2362–2371.

  • Zhang, J., & Yu, P. S. (2018). Broad learning: An emerging area in social network analysis. ACM SIGKDD Explorations Newsletter, 20(1), 24–50.

    Article  Google Scholar 

  • Zheng, Y. (2015). Methodologies for cross-domain data fusion: An overview. IEEE Transactions on Big Data, 1(1), 16–34.

    Article  Google Scholar 

  • Zhou, F., Xu, X., Li, C., Trajcevski, G., Zhong, T., & Zhang, K. (2020a). A heterogeneous dynamical graph neural networks approach to quantify scientific impact. ArXiv Preprint ArXiv: 2003.12042.

  • Zhou, Y., Cheng, H., Li, Q., & Wang, W. (2020b). Diversity of temporal influence in popularity prediction of scientific publications. Scientometrics, 123(1), 383–392.

    Article  Google Scholar 

  • Zhou, Y., Wang, R., Zeng, A., & Zhang, Y.-C. (2020c). Identifying prize-winning scientists by a competition-aware ranking. Journal of Informetrics, 14(3), 101038.

    Article  Google Scholar 

  • Zhou, W., Gu, J., & Jia, Y. (2018). H-Index-based link prediction methods in citation network. Scientometrics, 117(1), 381–390.

    Article  Google Scholar 

  • Zhou, Y., Li, Q., Yang, X., & Cheng, H. (2021). Predicting the popularity of scientific publications by an age-based diffusion model. Journal of Informetrics, 15(4), 101177.

    Article  Google Scholar 

  • Zoller, D., Doerfel, S., Jäschke, R., Stumme, G., & Hotho, A. (2016). Posted, visited, exported: Altmetrics in the social tagging system BibSonomy. Journal of Informetrics, 10(3), 732–749.

    Article  Google Scholar 

  • Zuo, Z., & Zhao, K. (2021). Understanding and predicting future research impact at different career stages—A social network perspective. Journal of the Association for Information Science and Technology, 72(4), 454–472.

    Article  Google Scholar 

Download references

Funding

This research is partially supported by the National Natural Science Foundation of China (Grant Nos. 62202395 and 62176221), Sichuan Academic Achievement Analysis and Application Research Center (Grant No. XSCG2021-021), Natural Science Foundation of Sichuan Province (Grant No. 2022NSFSC0930), Fundamental Research Funds for the Central Universities (Grant No. 2682022CX067) and Youth Talent Startup Grant of SWJTU-Leeds Joint School.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chongshou Li.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xia, W., Li, T. & Li, C. A review of scientific impact prediction: tasks, features and methods. Scientometrics 128, 543–585 (2023). https://doi.org/10.1007/s11192-022-04547-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-022-04547-8

Keywords

Navigation