A review of scientific impact prediction: tasks, features and methods

Xia, Wanjun; Li, Tianrui; Li, Chongshou

doi:10.1007/s11192-022-04547-8

A review of scientific impact prediction: tasks, features and methods

Published: 26 November 2022

Volume 128, pages 543–585, (2023)
Cite this article

Scientometrics Aims and scope Submit manuscript

1368 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

With the rapid evolution of scientific research, there are a huge volume of papers published every year and the number of scholars is also growing fast. How to effectively predict the scientific impact has become an important research problem, attracting the attention of researchers in various fields, and it is of great significance in improving research efficiency and assisting in decision-making and scientific evaluation. In this paper, we propose a new framework to perform a systematical survey of scientific impact prediction research. Specifically, we take the four common academic entities into account: papers, scholars, venues and institutions. We reviewed all the prediction tasks reported in the literature in detail; the input features are divided into six groups: paper-related, author-related, venue-related, institution-related, network-related and altmetrics-related. Moreover, we classify the forecasting methods into mathematical statistics-based, traditional machine learning-based, deep learning-based and graph-based, and subdivide each category according to the characteristics. Finally, we discuss open issues and existing challenges, and provide potential research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting the Evolution of Scientific Output

Predicting High Impact Academic Papers Using Citation Network Features

Research trend prediction in computer science publications: a deep neural network approach

Article 20 January 2022

Notes

References

Abbasi, A., Altmann, J., & Hossain, L. (2011). Identifying the effects of co-authorship networks on the performance of scholars: A correlation and regression analysis of performance measures and social network analysis measures. Journal of Informetrics, 5(4), 594–607.
Article Google Scholar
Abramo, G., D’Angelo, C. A., & Felici, G. (2019a). Predicting publication long-term impact through a combination of early citations and journal impact factor. Journal of Informetrics, 13(1), 32–49.
Article Google Scholar
Abramo, G., D’Angelo, C. A., & Reale, E. (2019b). Peer review versus bibliometrics: Which method better predicts the scholarly impact of publications? Scientometrics, 121(1), 537–554.
Article Google Scholar
Abrishami, A., & Aliakbary, S. (2019). Predicting citation counts based on deep neural network learning techniques. Journal of Informetrics, 13(2), 485–499.
Article Google Scholar
Acuna, D. E., Allesina, S., & Kording, K. P. (2012). Predicting scientific success. Nature, 489(7415), 201–202.
Article Google Scholar
Akella, A. P., Alhoori, H., Kondamudi, P. R., Freeman, C., & Zhou, H. (2021). Early indicators of scientific impact: Predicting citations with altmetrics. Journal of Informetrics, 15(2), 101128.
Article Google Scholar
Ashton, S. V., & Oppenheim, C. (1978). A method of predicting Nobel prizewinners in chemistry. Social Studies of Science, 8(3), 341–348.
Article Google Scholar
Ayaz, S., Masood, N., & Islam, M. A. (2018). Predicting scientific impact based on h-index. Scientometrics, 114(3), 993–1010.
Article Google Scholar
Bai, X., Liu, H., Zhang, F., Ning, Z., Kong, X., Lee, I., & Xia, F. (2017a). An overview on evaluating and predicting scholarly article impact. Information, 8(3), 73.
Article Google Scholar
Bai, X., Zhang, F., Hou, J., Xia, F., Tolba, A., & Elashkar, E. (2017b). Implicit multi-feature learning for dynamic time series prediction of the impact of institutions. IEEE Access, 5, 16372–16382.
Article Google Scholar
Bai, X., Pan, H., Hou, J., Guo, T., Lee, I., & Xia, F. (2020). Quantifying success in science: An overview. IEEE Access, 8, 123200–123214.
Article Google Scholar
Bai, X., Zhang, F., & Lee, I. (2019). Predicting the citations of scholarly paper. Journal of Informetrics, 13(1), 407–418.
Article Google Scholar
Bento, C., Martins, B., & Calado, P. (2013). Predicting the Future Impact of Academic Publications. In L. Correia, L. P. Reis, & J. Cascalho (Eds.), Portuguese Conference on Artificial Intelligence (pp. 366–377). Springer.
Google Scholar
Bertsimas, D., Brynjolfsson, E., Reichman, S., & Silberhoz, J. (2013). Network analysis for predicting academic impact. Proceedings of the 34th International Conference on Information Systems (ICIS), 92.
Bhat, H. S., Huang, L.-H., Rodriguez, S., Dale, R., & Heit, E. (2015). Citation prediction using diverse features. IEEE International Conference on Data Mining Workshop (ICDMW), 2015, 589–596.
Article Google Scholar
Bornmann, L., & Daniel, H.-D. (2010). Citation speed as a measure to predict the attention an article receives: An investigation of the validity of editorial decisions at Angewandte Chemie International Edition. Journal of Informetrics, 4(1), 83–88.
Article Google Scholar
Bornmann, L., Leydesdorff, L., & Wang, J. (2014). How to improve the prediction based on citation impact percentiles for years shortly after the publication date? Journal of Informetrics, 8(1), 175–180.
Article Google Scholar
Brizan, D. G., Gallagher, K., Jahangir, A., & Brown, T. (2016). Predicting citation patterns: Defining and determining influence. Scientometrics, 108(1), 183–200.
Article Google Scholar
Brody, T., Harnad, S., & Carr, L. (2006). Earlier web usage statistics as predictors of later citation impact. Journal of the American Society for Information Science and Technology, 57(8), 1060–1072.
Article Google Scholar
Bütün, E., & Kaya, M. (2019). Predicting citation count of scientists as a link prediction problem. IEEE Transactions on Cybernetics, 50(10), 4518–4529.
Article Google Scholar
Cao, X., Chen, Y., & Liu, K. R. (2016). A data analytic approach to quantifying scientific impact. Journal of Informetrics, 10(2), 471–484.
Article Google Scholar
Chakraborty, T., Kumar, S., Goyal, P., Ganguly, N., & Mukherjee, A. (2014). Towards a stratified learning approach to predict future citation counts. IEEE/ACM Joint Conference on Digital Libraries, 351–360.
Chawla, D. S. (2021). Frosty reception for algorithm that predicts research papers’ impact. Nature.
Cheang, B., Chu, S. K. W., Li, C., & Lim, A. (2014a). A multidimensional approach to evaluating management journals: Refining PageRank via the differentiation of citation types and identifying the roles that management journals play. Journal of the Association for Information Science and Technology, 65(12), 2581–2591.
Article Google Scholar
Cheang, B., Chu, S. K. W., Li, C., & Lim, A. (2014b). OR/MS journals evaluation based on a refined PageRank method: An updated and more comprehensive review. Scientometrics, 100(2), 339–361.
Article Google Scholar
Cheang, B., Li, C., Lim, A., & Zhang, Z. (2015). Identifying patterns and structural influences in the scientific communication of business knowledge. Scientometrics, 103(1), 159–189.
Article Google Scholar
Chen, C. (2012). Predictive effects of structural variation on citation counts. Journal of the American Society for Information Science and Technology, 63(3), 431–449.
Article Google Scholar
Cui, P., Shen, Z., Li, S., Yao, L., Li, Y., Chu, Z., & Gao, J. (2020). Causal inference meets machine learning. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 3527–3528.
Cummings, D., & Nassar, M. (2020). Structured citation trend prediction using graph neural networks. ICASSP 2020–2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 3897–3901.
Danell, R. (2011). Can the quality of scientific work be predicted using information on the author’s track record? Journal of the American Society for Information Science and Technology, 62(1), 50–60.
Article Google Scholar
Daud, A., Aljohani, N. R., Abbasi, R. A., Rafique, Z., Amjad, T., Dawood, H., & Alyoubi, K. H. (2017). Finding rising stars in co-author networks via weighted mutual influence. Proceedings of the 26th International Conference on World Wide Web Companion, 33–41.
Daud, A., Abbasi, R., & Muhammad, F. (2013). Finding rising stars in social networks. In W. Meng, L. Feng, S. Bressan, W. Winiwarter, & W. Song (Eds.), International conference on database systems for advanced applications (pp. 13–24). Springer.
Chapter Google Scholar
Daud, A., Ahmad, M., Malik, M. S. I., & Che, D. (2015). Using machine learning techniques for rising star prediction in co-author network. Scientometrics, 102(2), 1687–1711.
Article Google Scholar
Davletov, F., Aydin, A. S., & Cakmak, A. (2014). High impact academic paper prediction using temporal and topological features. Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 491–498.
de Abreu Batista-Jr, A., Gouveia, F. C., & Mena-Chalco, J. P. (2021). Predicting the Q of junior researchers using data from the first years of publication. Journal of Informetrics, 15(2), 101130.
Article Google Scholar
Dey, R., Roy, A., Chakraborty, T., & Ghosh, S. (2017). Sleeping beauties in computer science: Characterization and early identification. Scientometrics, 113(3), 1645–1663.
Article Google Scholar
Ding, Y., Zhang, G., Chambers, T., Song, M., Wang, X., & Zhai, C. (2014). Content-based citation analysis: The next generation of citation analysis. Journal of the Association for Information Science and Technology, 65(9), 1820–1833.
Article Google Scholar
Dong, Y., Johnson, R. A., & Chawla, N. V. (2016). Can scientific impact be predicted? IEEE Transactions on Big Data, 2(1), 18–30.
Article Google Scholar
Drongstrup, D., Malik, S., Aljohani, N. R., Alelyani, S., Safder, I., & Hassan, S.-U. (2020). Can social media usage of scientific literature predict journal indices of AJG, SNIP and JCR? An altmetric study of Economics. Scientometrics, 125(2), 1541–1558.
Article Google Scholar
Du, W., Xie, Z., & Lv, Y. (2021). Predicting publication productivity for authors: Shallow or deep architecture? Scientometrics, 126(7), 5855–5879.
Article Google Scholar
Fong, E. A., & Wilhite, A. W. (2017). Authorship and citation manipulation in academic research. PLoS ONE, 12(12), e0187394.
Article Google Scholar
Fortunato, S., Bergstrom, C. T., Börner, K., Evans, J. A., Helbing, D., Milojević, S., Petersen, A. M., Radicchi, F., Sinatra, R., & Uzzi, B. (2018). Science of science. Science, 359(6379), eaao0185.
Article Google Scholar
Fronzetti Colladon, A., D’Angelo, C. A., & Gloor, P. A. (2020). Predicting the future success of scientific publications through social network and semantic analysis. Scientometrics, 124(1), 357–377.
Article Google Scholar
Fu, L., & Aliferis, C. (2010). Using content-based and bibliometric features for machine learning models to predict citation counts in the biomedical literature. Scientometrics, 85(1), 257–270.
Article Google Scholar
García-Pérez, M. A. (2013). Limited validity of equations to predict the future h index. Scientometrics, 96(3), 901–909.
Article Google Scholar
Gardfield, E. (1977). The 250 “most-cited primary authors, 1961–1975” Part II: The correlation between citedness, noble prizes and academy memberships. Current Comments, 50, 5–15.
Google Scholar
Gingras, Y., & Wallace, M. (2010). Why it has become more difficult to predict Nobel Prize winners: A bibliometric analysis of nominees and winners of the chemistry and physics prizes (1901–2007). Scientometrics, 82(2), 401–412.
Article Google Scholar
Giuffrida, C., Abramo, G., & D’Angelo, C. A. (2019). Are all citations worth the same? Valuing citations by the value of the citing items. Journal of Informetrics, 13(2), 500–514.
Article Google Scholar
Ha, L., Jiang, W., Bi, C., Zhang, R., Zhang, T., & Wen, X. (2016). How online usage of subscription-based journalism and mass communication research journal articles predicts citations. Learned Publishing, 29(3), 183–192.
Article Google Scholar
Haslam, N., Ban, L., Kaufmann, L., Loughnan, S., Peters, K., Whelan, J., & Wilson, S. (2008). What makes an article influential? Predicting impact in social and personality psychology. Scientometrics, 76(1), 169–185.
Article Google Scholar
Hirsch, J. E. (2007). Does the h index have predictive power? Proceedings of the National Academy of Sciences, 104(49), 19193–19198.
Article Google Scholar
Holm, A. N., Plank, B., Wright, D., & Augenstein, I. (2020). Longitudinal citation prediction using temporal graph neural networks. ArXiv Preprint ArXiv: 2012.05742.
Hou, J., Pan, H., Guo, T., Lee, I., Kong, X., & Xia, F. (2019). Prediction methods and applications in the science of science: A survey. Computer Science Review, 34, 100197.
Article Google Scholar
Hu, Y.-H., Tai, C.-T., Liu, K. E., & Cai, C.-F. (2020). Identification of highly-cited papers using topic-model-based and bibliometric features: The consideration of keyword popularity. Journal of Informetrics, 14(1), 101004.
Article Google Scholar
Ibáñez, A., Larrañaga, P., & Bielza, C. (2011). Predicting the h-index with cost-sensitive naive Bayes. 2011 11th International Conference on Intelligent Systems Design and Applications, 599–604.
Jensen, P., Rouquier, J.-B., & Croissant, Y. (2009). Testing bibliometric indicators by their prediction of scientists promotions. Scientometrics, 78(3), 467–479.
Article Google Scholar
Jiang, S., Koch, B., & Sun, Y. (2021). HINTS: Citation time series prediction for new publications via dynamic heterogeneous information network embedding. Proceedings of the Web Conference, 2021, 3158–3167.
Google Scholar
Kanellos, I., Vergoulis, T., Sacharidis, D., Dalamagas, T., & Vassiliou, Y. (2021). Ranking papers by their short-term scientific impact. 2021 IEEE 37th International Conference on Data Engineering (ICDE), 1997–2002.
Ke, Q., Ferrara, E., Radicchi, F., & Flammini, A. (2015). Defining and identifying sleeping beauties in science. Proceedings of the National Academy of Sciences, 112(24), 7426–7431.
Article Google Scholar
Klemiński, R., Kazienko, P., & Kajdanowicz, T. (2021). Where should I publish? Heterogeneous, networks-based prediction of paper’s citation success. Journal of Informetrics, 15(3), 101200.
Article Google Scholar
Klimek, P., Jovanovic, S., Egloff, A., & Schneider, R. (2016). Successful fish go with the flow: Citation impact prediction based on centrality measures for term–document networks. Scientometrics, 107(3), 1265–1282.
Article Google Scholar
Kong, X., Zhang, J., Zhang, D., Bu, Y., Ding, Y., & Xia, F. (2020). The gene of scientific success. ACM Transactions on Knowledge Discovery from Data (TKDD), 14(4), 1–19.
Article Google Scholar
Laurance, W. F., Useche, D. C., Laurance, S. G., & Bradshaw, C. J. (2013). Predicting publication success for biologists. BioScience, 63(10), 817–823.
Article Google Scholar
Lee, D. H. (2019). Predicting the research performance of early career scientists. Scientometrics, 121(3), 1481–1504.
Article Google Scholar
Levitt, J. M., & Thelwall, M. (2011). A combined bibliometric indicator to predict article impact. Information Processing & Management, 47(2), 300–308.
Article Google Scholar
Li, X.-L., Foo, C. S., Tew, K. L., & Ng, S.-K. (2009). Searching for rising stars in bibliography networks. International Conference on Database Systems for Advanced Applications, 288–292.
Li, C.-T., Lin, Y.-J., Yan, R., & Yeh, M.-Y. (2015). Trend-based citation count prediction for research articles. Pacific-Asia Conference on Knowledge Discovery and Data Mining, 659–671.
Li, L., & Tong, H. (2015). The child is father of the man: Foresee the success at the early stage. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 655–664.
Li, S., Zhao, W. X., Yin, E. J., & Wen, J.-R. (2019a). A neural citation count prediction model based on peer review text. Proceedings of the 2019a Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4914–4924.
Li, W., Aste, T., Caccioli, F., & Livan, G. (2019b). Early coauthorship with top scientists predicts success in academic careers. Nature Communications, 10(1), 1–9.
Article Google Scholar
Lindahl, J. (2018). Predicting research excellence at the individual level: The importance of publication rate, top journal publications, and top 10% publications in the case of early career mathematicians. Journal of Informetrics, 12(2), 518–533.
Article Google Scholar
Lindahl, J., Colliander, C., & Danell, R. (2020). Early career performance and its correlation with gender and publication output during doctoral education. Scientometrics, 122(1), 309–330.
Article Google Scholar
Liu, L., Yu, D., Wang, D., & Fukumoto, F. (2020). Citation count prediction based on neural hawkes model. IEICE Transactions on Information and Systems, 103(11), 2379–2388.
Article Google Scholar
Livne, A., Adar, E., Teevan, J., & Dumais, S. (2013). Predicting citation counts using text and graph mining. Proc. the IConference 2013 Workshop on Computational Scientometrics: Theory and Applications, 1–4.
Luo, Z., He, J., Qian, J., Wang, Y., Chen, J., & Lu, W. (2020). Can scientific publication’s network structural features predict its citation? Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in, 2020, 485–486.
Google Scholar
Ma, A., Liu, Y., Xu, X., & Dong, T. (2021). A deep-learning based citation count prediction model with paper metadata semantic features. Scientometrics, 126(8), 6803–6823.
Article Google Scholar
Ma, Y., & Uzzi, B. (2018). Scientific prize network predicts who pushes the boundaries of science. Proceedings of the National Academy of Sciences, 115(50), 12608–12615.
Article Google Scholar
Mahalakshmi, G. S., Sendhilkumar, S., Jancy, P., & Easwarakumar, K. S. (2020). A Neural Learning Approach for Prediction of Research Citations Using Article Semantics. 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS), 816–819.
Mazloumian, A. (2012). Predicting scholars’ scientific impact. PLoS ONE, 7(11), e49246.
Article Google Scholar
Mistele, T., Price, T., & Hossenfelder, S. (2019). Predicting authors’ citation counts and h-indices with a neural network. Scientometrics, 120(1), 87–104.
Article Google Scholar
Nezhadbiglari, M., Gonçalves, M. A., & Almeida, J. M. (2016). Early prediction of scholar popularity. Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 181–190.
Nie, Y., Zhu, Y., Lin, Q., Zhang, S., Shi, P., & Niu, Z. (2019). Academic rising star prediction via scholar’s evaluation model and machine learning techniques. Scientometrics, 120(2), 461–476.
Article Google Scholar
Onodera, N., & Yoshikane, F. (2015). Factors affecting citation rates of research articles. Journal of the Association for Information Science and Technology, 66(4), 739–764.
Article Google Scholar
Panagopoulos, G., Tsatsaronis, G., & Varlamis, I. (2017). Detecting rising stars in dynamic collaborative networks. Journal of Informetrics, 11(1), 198–222.
Article Google Scholar
Park, H.-M., Sinshaw, Y. B., & Sohn, K.-A. (2017). Temporal citation network-based feature extraction for cited count prediction. International Conference on Mobile and Wireless Technology, 380–388.
Penner, O., Pan, R. K., Petersen, A. M., Kaski, K., & Fortunato, S. (2013). On the predictability of future impact in science. Scientific Reports, 3(1), 1–8.
Article Google Scholar
Pobiedina, N., & Ichise, R. (2016). Citation count prediction as a link prediction problem. Applied Intelligence, 44(2), 252–268.
Article Google Scholar
Põder, E. (2017). A framework for the measurement and prediction of an individual scientist’s performance. Trames, 21(1), 3–14.
Article Google Scholar
Porwal, P., & Devare, M. H. (2020). Citation Classification Prediction Implying Text Features Using Natural Language Processing and Supervised Machine Learning Algorithms. International Conference on Recent Trends in Image Processing and Pattern Recognition, 540–552.
Qian, Y., Dong, Y., Ma, Y., Jin, H., & Li, J. (2016). Feature engineering and ensemble modeling for paper acceptance rank prediction. ArXiv Preprint ArXiv: 1611.04369.
Rokach, L., Kalech, M., Blank, I., & Stern, R. (2011). Who is going to win the next association for the advancement of artificial intelligence fellowship award? Evaluating researchers by mining bibliographic data. Journal of the American Society for Information Science and Technology, 62(12), 2456–2470.
Article Google Scholar
Ruan, X., Zhu, Y., Li, J., & Cheng, Y. (2020). Predicting the citation counts of individual papers via a BP neural network. Journal of Informetrics, 14(3), 101039.
Article Google Scholar
Sandulescu, V., & Chiru, M. (2016). Predicting the future relevance of research institutions-The winning solution of the KDD Cup 2016. ArXiv Preprint ArXiv: 1609.02728.
Sayyadi, H., & Getoor, L. (2009). Futurerank: Ranking scientific articles by predicting their future pagerank. Proceedings of the 2009 SIAM International Conference on Data Mining, 533–544.
Schreiber, M. (2013). How relevant is the predictive power of the h-index? A case study of the time-dependent Hirsch index. Journal of Informetrics, 7(2), 325–329.
Article Google Scholar
Shen, H., Wang, D., Song, C., & Barabási, A.-L. (2014). Modeling and predicting popularity dynamics via reinforced poisson processes. Proceedings of the AAAI Conference on Artificial Intelligence. https://doi.org/10.1609/aaai.v28i1.8739
Article Google Scholar
Shuang, Q. (2016). Heterogenous Graph Mining for Measuring the Impact of Research Institutions.
Sinatra, R., Wang, D., Deville, P., Song, C., & Barabási, A.-L. (2016). Quantifying the evolution of individual scientific impact. Science, 354(6312), aaf5239.
Article Google Scholar
Singh, M., Patidar, V., Kumar, S., Chakraborty, T., Mukherjee, A., & Goyal, P. (2015). The role of citation context in predicting long-term citation profiles: An experimental study based on a massive bibliographic text dataset. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, 1271–1280.
Skarding, J., Gabrys, B., & Musial, K. (2021). Foundations and modeling of dynamic networks using dynamic graph neural networks: A survey. IEEE Access, 9, 79143–79168.
Article Google Scholar
Sohrabi, B., & Iraj, H. (2017). The effect of keyword repetition in abstract and keyword frequency per journal in predicting citation counts. Scientometrics, 110(1), 243–251.
Article Google Scholar
Stegehuis, C., Litvak, N., & Waltman, L. (2015). Predicting the long-term citation impact of recent publications. Journal of Informetrics, 9(3), 642–657.
Article Google Scholar
Stern, D. I. (2014). High-ranked social science journal articles can be identified from early citation information. PLoS ONE, 9(11), e112520. https://doi.org/10.1371/journal.pone.0112520
Article Google Scholar
Tahamtan, I., Safipour Afshar, A., & Ahamdzadeh, K. (2016). Factors affecting number of citations: A comprehensive review of the literature. Scientometrics, 107(3), 1195–1225.
Article Google Scholar
Thelwall, M., & Nevill, T. (2018). Could scientists use Altmetric: Com scores to predict longer term citation counts? Journal of Informetrics, 12(1), 237–248.
Article Google Scholar
Timilsina, M., Davis, B., Taylor, M., & Hayes, C. (2016). Towards predicting academic impact from mainstream news and weblogs: A heterogeneous graph based approach. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2016, 1388–1389.
Google Scholar
Valderrama, P., Escabias, M., Jiménez-Contreras, E., Valderrama, M. J., & Baca, P. (2018). A mixed longitudinal and cross-sectional model to forecast the journal impact factor in the field of Dentistry. Scientometrics, 116(2), 1203–1212.
Article Google Scholar
Van Dijk, D., Manor, O., & Carey, L. B. (2014). Publication metrics and success on the academic job market. Current Biology, 24(11), R516–R517.
Article Google Scholar
Van Noorden, R. (2017). The science That’s. Nature, 552, 162–164.
Article Google Scholar
Van Raan, A. F. (2004). Sleeping beauties in science. Scientometrics, 59(3), 467–472.
Article Google Scholar
Walker, D., Xie, H., Yan, K.-K., & Maslov, S. (2007). Ranking scientific publications using a model of network traffic. Journal of Statistical Mechanics: Theory and Experiment, 2007(06), P06010.
Article Google Scholar
Wan, X., & Liu, F. (2014). Are all literature citations equally important? Automatic citation strength estimation and its applications. Journal of the Association for Information Science and Technology, 65(9), 1929–1938.
Article Google Scholar
Wang, S., Xie, S., Zhang, X., Li, Z., Yu, P. S., & Shu, X. (2014). Future influence ranking of scientific literature. Proceedings of the 2014 SIAM International Conference on Data Mining, 749–757.
Wang, D., Song, C., & Barabási, A.-L. (2013). Quantifying long-term scientific impact. Science, 342(6154), 127–132.
Article Google Scholar
Wang, F., Fan, Y., Zeng, A., & Di, Z. (2019a). Can we predict ESI highly cited publications? Scientometrics, 118(1), 109–125.
Article Google Scholar
Wang, M., Jiao, S., Chai, K.-H., & Chen, G. (2019b). Building journal’s long-term impact: Using indicators detected from the sustained active articles. Scientometrics, 121(1), 261–283.
Article Google Scholar
Wang, M., Wang, Z., & Chen, G. (2019c). Which can better predict the future success of articles? Bibliometric indices or alternative metrics. Scientometrics, 119(3), 1575–1595.
Article Google Scholar
Wang, J., Zhang, F., Li, Y., & Liu, D. (2020). Attention-based multi-fusion method for citation prediction. In J.-S. Pan, J. Li, P.-W. Tsai, & L. C. Jain (Eds.), Advances in intelligent information hiding and multimedia signal processing (pp. 315–322). Springer.
Chapter Google Scholar
Wang, K., Shi, W., Bai, J., Zhao, X., & Zhang, L. (2021a). Prediction and application of article potential citations based on nonlinear citation-forecasting combined model. Scientometrics, 126(8), 6533–6550.
Article Google Scholar
Wang, W., Zhang, J., Zhou, F., Chen, P., & Wang, B. (2021b). Paper acceptance prediction at the institutional level based on the combination of individual and network features. Scientometrics, 126(2), 1581–1597.
Article Google Scholar
Wang, M., Yu, G., Xu, J., He, H., Yu, D., & An, S. (2012). Development a case-based classifier for predicting highly cited papers. Journal of Informetrics, 6(4), 586–599.
Article Google Scholar
Wang, M., Yu, G., & Yu, D. (2011). Mining typical features for highly cited papers. Scientometrics, 87(3), 695–706.
Article Google Scholar
Wang, S., Xie, S., Zhang, X., Li, Z., Yu, P. S., & He, Y. (2016). Coranking the future influence of multiobjects in bibliographic network through mutual reinforcement. ACM Transactions on Intelligent Systems and Technology (TIST), 7(4), 1–28.
Article Google Scholar
Way, S. F., Morgan, A. C., Clauset, A., & Larremore, D. B. (2017). The misleading narrative of the canonical faculty productivity trajectory. Proceedings of the National Academy of Sciences, 114(44), E9216–E9223.
Article Google Scholar
Way, S. F., Morgan, A. C., Larremore, D. B., & Clauset, A. (2019). Productivity, prominence, and the effects of academic environment. Proceedings of the National Academy of Sciences, 116(22), 10729–10733.
Article Google Scholar
Weihs, L., & Etzioni, O. (2017). Learning to predict citation-based impact measures. ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2017, 1–10.
Google Scholar
Weis, J. W., & Jacobson, J. M. (2021). Learning on knowledge graph dynamics provides an early warning of impactful research. Nature Biotechnology, 39(10), 1300–1307.
Article Google Scholar
Wen, J., Wu, L., & Chai, J. (2020). Paper citation count prediction based on recurrent neural network with gated recurrent unit. 2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC), 303–306.
Wilson, J., Mohan, R., Arif, M., Chaudhury, S., & Lall, B. (2016). Ranking academic institutions on potential paper acceptance in upcoming conferences. ArXiv Preprint ArXiv: 1610.02828.
Wu, X., Fu, Q., & Rousseau, R. (2008). On indexing in the Web of Science and predicting journal impact factor. Journal of Zhejiang University Science B, 9(7), 582–590.
Article Google Scholar
Wu, Z., Lin, W., Liu, P., Chen, J., & Mao, L. (2019). Predicting long-term scientific impact based on multi-field feature extraction. IEEE Access, 7, 51759–51770.
Article Google Scholar
Xia, F., Wang, W., Bekele, T. M., & Liu, H. (2017). Big scholarly data: A survey. IEEE Transactions on Big Data, 3(1), 18–35.
Article Google Scholar
Xiao, C., Han, J., Fan, W., Wang, S., Huang, R., & Zhang, Y. (2019). Predicting scientific impact via heterogeneous academic network embedding. Pacific Rim International Conference on Artificial Intelligence, 555–568.
Xiao, C., Sun, L., Han, J., & Qiao, Y. (2021). Heterogeneous academic network embedding based multivariate random-walk model for predicting scientific impact. Applied Intelligence, 1–18.
Xie, Z. (2020). Predicting publication productivity for researchers: A piecewise Poisson model. Journal of Informetrics, 14(3), 101065.
Article Google Scholar
Xu, J., Li, M., Jiang, J., Ge, B., & Cai, M. (2019). Early prediction of scientific impact based on multi-bibliographic features and convolutional neural network. IEEE Access, 7, 92248–92258.
Article Google Scholar
Yan, R., Tang, J., Liu, X., Shan, D., & Li, X. (2011). Citation count prediction: Learning to estimate future citations for literature. Proceedings of the 20th ACM International Conference on Information and Knowledge Management, 1247–1252.
Yu, T., Yu, G., Li, P.-Y., & Wang, L. (2014). Citation impact prediction for scientific papers using stepwise regression analysis. Scientometrics, 101(2), 1233–1252.
Article Google Scholar
Yu, X., Szymanski, B. K., & Jia, T. (2021). Become a better you: Correlation between the change of research direction and the change of scientific performance. Journal of Informetrics, 15(3), 101193.
Article Google Scholar
Yuan, S., Tang, J., Zhang, Y., Wang, Y., & Xiao, T. (2018). Modeling and predicting citation count via recurrent neural network with long short-term memory. ArXiv Preprint ArXiv: 1811.02129.
Zhang, C., Liu, C., Yu, L., Zhang, Z.-K., & Zhou, T. (2017). Identifying the academic rising stars via pairwise citation increment ranking. Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, 475–483.
Zhang, F., Bai, X., & Lee, I. (2019). Author impact: Evaluations, predictions, and challenges. IEEE Access, 7, 38657–38669.
Article Google Scholar
Zhang, F., & Wu, S. (2020). Predicting future influence of papers, researchers, and venues in a dynamic academic network. Journal of Informetrics, 14(2), 101035.
Article Google Scholar
Zhang, F., & Wu, S. (2021). Measuring academic entities’ impact by content-based citation analysis in a heterogeneous academic network. Scientometrics, 126(8), 7197–7222.
Article Google Scholar
Zhang, J., Ning, Z., Bai, X., Wang, W., Yu, S., & Xia, F. (2016a). Who are the rising stars in academia? IEEE/ACM Joint Conference on Digital Libraries (JCDL), 2016, 211–212.
Article Google Scholar
Zhang, J., Xia, F., Wang, W., Bai, X., Yu, S., Bekele, T. M., & Peng, Z. (2016b). Cocarank: A collaboration caliber-based method for finding academic rising stars. Proceedings of the 25th International Conference Companion on World Wide Web, 395–400.
Zhang, J., Xu, B., Liu, J., Tolba, A., Al-Makhadmeh, Z., & Xia, F. (2018a). PePSI: Personalized prediction of scholars’ impact in heterogeneous temporal academic networks. IEEE Access, 6, 55661–55672.
Article Google Scholar
Zhang, L., Xie, Y., Xidao, L., & Zhang, X. (2018b). Multi-source heterogeneous data fusion. International Conference on Artificial Intelligence and Big Data (ICAIBD), 2018, 47–51.
Google Scholar
Zhang, X., Fu, L., & Wang, X. (2018c). Ranking the Future Influence of Scientific Literatures. 2018c IEEE 4th International Conference on Computer and Communications (ICCC), 2362–2371.
Zhang, J., & Yu, P. S. (2018). Broad learning: An emerging area in social network analysis. ACM SIGKDD Explorations Newsletter, 20(1), 24–50.
Article Google Scholar
Zheng, Y. (2015). Methodologies for cross-domain data fusion: An overview. IEEE Transactions on Big Data, 1(1), 16–34.
Article Google Scholar
Zhou, F., Xu, X., Li, C., Trajcevski, G., Zhong, T., & Zhang, K. (2020a). A heterogeneous dynamical graph neural networks approach to quantify scientific impact. ArXiv Preprint ArXiv: 2003.12042.
Zhou, Y., Cheng, H., Li, Q., & Wang, W. (2020b). Diversity of temporal influence in popularity prediction of scientific publications. Scientometrics, 123(1), 383–392.
Article Google Scholar
Zhou, Y., Wang, R., Zeng, A., & Zhang, Y.-C. (2020c). Identifying prize-winning scientists by a competition-aware ranking. Journal of Informetrics, 14(3), 101038.
Article Google Scholar
Zhou, W., Gu, J., & Jia, Y. (2018). H-Index-based link prediction methods in citation network. Scientometrics, 117(1), 381–390.
Article Google Scholar
Zhou, Y., Li, Q., Yang, X., & Cheng, H. (2021). Predicting the popularity of scientific publications by an age-based diffusion model. Journal of Informetrics, 15(4), 101177.
Article Google Scholar
Zoller, D., Doerfel, S., Jäschke, R., Stumme, G., & Hotho, A. (2016). Posted, visited, exported: Altmetrics in the social tagging system BibSonomy. Journal of Informetrics, 10(3), 732–749.
Article Google Scholar
Zuo, Z., & Zhao, K. (2021). Understanding and predicting future research impact at different career stages—A social network perspective. Journal of the Association for Information Science and Technology, 72(4), 454–472.
Article Google Scholar

Download references

Funding

This research is partially supported by the National Natural Science Foundation of China (Grant Nos. 62202395 and 62176221), Sichuan Academic Achievement Analysis and Application Research Center (Grant No. XSCG2021-021), Natural Science Foundation of Sichuan Province (Grant No. 2022NSFSC0930), Fundamental Research Funds for the Central Universities (Grant No. 2682022CX067) and Youth Talent Startup Grant of SWJTU-Leeds Joint School.

Author information

Authors and Affiliations

Library, Southwest Jiaotong University, Chengdu, China
Wanjun Xia
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
Wanjun Xia, Tianrui Li & Chongshou Li

Authors

Wanjun Xia
View author publications
You can also search for this author in PubMed Google Scholar
Tianrui Li
View author publications
You can also search for this author in PubMed Google Scholar
Chongshou Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chongshou Li.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xia, W., Li, T. & Li, C. A review of scientific impact prediction: tasks, features and methods. Scientometrics 128, 543–585 (2023). https://doi.org/10.1007/s11192-022-04547-8

Download citation

Received: 10 March 2022
Accepted: 01 October 2022
Published: 26 November 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s11192-022-04547-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review of scientific impact prediction: tasks, features and methods

Abstract

Access this article

Similar content being viewed by others

Predicting the Evolution of Scientific Output

Predicting High Impact Academic Papers Using Citation Network Features

Research trend prediction in computer science publications: a deep neural network approach

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A review of scientific impact prediction: tasks, features and methods

Abstract

Access this article

Similar content being viewed by others

Predicting the Evolution of Scientific Output

Predicting High Impact Academic Papers Using Citation Network Features

Research trend prediction in computer science publications: a deep neural network approach

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation