Machine Learning Based Approach for Future Prediction of Authors in Research Academics

Bhattacharya, Sovan; Banerjee, Ayan; Goswami, Abhik; Nandi, Subrata; Pradhan, Dinesh Kumar

doi:10.1007/s42979-023-01692-6

Machine Learning Based Approach for Future Prediction of Authors in Research Academics

Original Research
Published: 01 April 2023

Volume 4, article number 306, (2023)
Cite this article

SN Computer Science Aims and scope Submit manuscript

Sovan Bhattacharya ORCID: orcid.org/0000-0002-0586-0160^1,2^na1,
Ayan Banerjee¹^na1,
Abhik Goswami¹^na1,
Subrata Nandi²^na1 &
…
Dinesh Kumar Pradhan¹^na1

140 Accesses
1 Citation
Explore all metrics

Abstract

In the present time, it is important to predict superior authors from the huge research community, which determines their performance with their future prospects and opportunities, in the field of scientific research. There is a need to solve the authors’ future prediction problem for equally assessing the performance of the researchers that helps to improve research quality highly, influence other types of research, that will also help to identify the research carrier and other research parameters, that will further influence budding researchers. In our proposed model, to solve the authors’ future prediction problems, five machines learning models i.e., SVM, Logistic Regression, Naive Bayes, Decision Tree, and Random Forest are implemented on a data set of 2750 authors where 1000 authors are the ranked authors. So, we have collected 2750 authors’ data from their bibliographic data sets. Then used feature engineering and feature scaling to build the desired model and prepare the final data set used to build the model. There after varied data sizes and training ratios against the five ML models and measured all the evaluation metrics in different cases. We have seen that the Decision Tree and Random Forest model outperform the other models with an accuracy score of 1.0 and the other scores also performed well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feature Selection Using Extra Trees Classifier for Research Productivity Framework in Indonesia

A Data Mining Approach for Predicting Academic Success – A Case Study

Machine Learning Applied to the H Index of Colombian Authors with Publications in Scopus

Notes

References

Ding Y, Yan E, Frazho A, Caverlee J. Pagerank for ranking authors in co-citation networks. J Am Soc Inf Sci Technol. 2009;60(11):2229–43.
Article Google Scholar
Zhao P, Han J, Sun Y. P-rank: a comprehensive structural similarity measure over information networks. In: Proceedings of the 18th ACM conference on information and knowledge management. 2009; p. 553–562.
Gollapalli SD, Mitra P, Giles CL. Ranking authors in digital libraries. In: Proceedings of the 11th annual international ACM/IEEE joint conference on digital libraries. 2011; p. 251–254.
Senanayake U, Piraveenan M, Zomaya AY. The p-index: ranking scientists using network dynamics. Procedia Comput Sci. 2014;29:465–77.
Article Google Scholar
Liu Z, Huang H, Wei X, Mao X. Tri-rank: an authority ranking framework in heterogeneous academic networks by mutual reinforce. In: 2014 IEEE 26th international conference on tools with artificial intelligence. IEEE; 2014. p. 493–500.
Pradhan D, Paul PS, Maheswari U, Nandi S, Chakraborty T. C3-index: revisiting author’s performance measure. In: Proceedings of the 8th ACM conference on web science. 2016; p. 318–319.
Franceschet M, Colavizza G. Timerank: a dynamic approach to rate scholars using citations. J Informetr. 2017;11(4):1128–41.
Article Google Scholar
Zhang J, Ning Z, Bai X, Kong X, Zhou J, Xia F. Exploring time factors in measuring the scientific impact of scholars. Scientometrics. 2017;112(3):1301–21.
Article Google Scholar
Zhang J, Hu Y, Ning Z, Tolba A, Elashkar E, Xia F. AIRank: author impact ranking through positions in collaboration networks. Complexity. 2018;2018:4697485.
Google Scholar
Zhang C, Yu L, Zhang X, Chawla NV. Task-guided and semantic-aware ranking for academic author-paper correlation inference. International Joint Conferences on Artificial Intelligence. 2018.
Bornmann L, Leydesdorff L, Wang J. How to improve the prediction based on citation impact percentiles for years shortly after the publication date? J Informetr. 2014;8(1):175–80.
Article Google Scholar
Varlamis PGTG. I detecting rising stars in dynamic collaborative networks. J Informetr. 2017;11(1):198.
Article Google Scholar
Daud A, Aljohani NR, Abbasi RA, Rafique Z, Amjad T, Dawood H, Alyoubi KH. Finding rising stars in co-author networks via weighted mutual influence. In: Proceedings of the 26th international conference on world wide web companion. 2017; p. 33–41.
Amjad T, Daud A, Aljohani NR. Ranking authors in academic social networks: a survey. Library Hi Tech. 2018.
Zhu L, Zhu D, Wang X, Cunningham SW, Wang Z. An integrated solution for detecting rising technology stars in co-inventor networks. Scientometrics. 2019;121(1):137–72.
Article Google Scholar
Zhang F, Bai X, Lee I. Author impact: evaluations, predictions, and challenges. IEEE Access. 2019;7:38657–69.
Article Google Scholar
Wang M, Zhang J, Jiao S, Zhang T. Evaluating the impact of citations of articles based on knowledge flow patterns hidden in the citations. PLoS One. 2019;14(11):0225276.
Article Google Scholar
Daud A, Song M, Hayat MK, Amjad T, Abbasi RA, Dawood H, Ghani A. Finding rising stars in bibliometric networks. Scientometrics. 2020;124(1):633–61.
Article Google Scholar
Amjad T, Munir J. Investigating the impact of collaboration with authority authors: a case study of bibliographic data in field of philosophy. Scientometrics. 2021;126(5):4333–53.
Article Google Scholar
Xiao S, Yan J, Li C, Jin B, Wang X, Yang X, Chu SM, Zha H. On modeling and predicting individual paper citation count over time. In: Ijcai. 2016; p. 2676–2682.
Ashraf S, Iqbal HR, Nawab RMA. Cross-genre author profile prediction using stylometry-based approach. In: CLEF (Working Notes). Citeseer; 2016. p. 992–999.
Reddy TR, Vardhan BV, Reddy PV. Author profile prediction using pivoted unique term normalization. Indian J Sci Technol. 2016; 9(46). https://doi.org/10.17485/ijst/2016/v9i46/99404.
Nie Y, Zhu Y, Lin Q, Zhang S, Shi P, Niu Z. Academic rising star prediction via scholar’s evaluation model and machine learning techniques. Scientometrics. 2019;120(2):461–76.
Article Google Scholar
Bin-Obaidellah O, Al-Fagih AE. Scientometric indicators and machine learning-based models for predicting rising stars in academia. In: 2019 7th international conference on smart computing & communications (ICSCC). IEEE; 2019. p. 1–7.
Bütün E, Kaya M. Predicting citation count of scientists as a link prediction problem. IEEE Trans Cybern. 2019;50(10):4518–29.
Article Google Scholar
Shoaib M, Daud A, Amjad T. Author name disambiguation in bibliographic databases: a survey. arXiv preprint arXiv:2004.06391. 2020.
Su Z. Prediction of future citation count with machine learning and neural network. In: 2020 Asia-Pacific conference on image processing, electronics and computers (IPEC). IEEE; 2020. p. 101–104.
Fujita M, Inoue H, Terano T. Analyzing promising researchers using network centralities of co-authorship networks from academic literature. New Gener Comput. 2021;39(1):181–97.
Article Google Scholar
Khan ZY, Niu Z, Sandiwarno S, Prince R. Deep learning techniques for rating prediction: a survey of the state-of-the-art. Artif Intell Rev. 2021;54(1):95–135.
Article Google Scholar
Daud A, ul Islam N, Li X, Razzak I, Hayat MK. Identifying rising stars via supervised machine learning. IEEE Trans Comput Soc Syst. 2022. https://doi.org/10.1109/TCSS.2022.3178070.
Ávila-Toscano JH, Romero-Pérez I, Saavedra-Guajardo E, Marenco-Escuderos A. Determinants of colombian scientific production in social sciences articles indexed in wos, scopus and other databases: tree of classification and regression. Rev Interam Bibliotecol. 2022;45(1). https://doi.org/10.17533/udea.rib.v45n1e339712.
Bhattacharya S. Discoveries of research genealogy from large-scale academic dataset: issues, challenges and application. Int J Comput Sci Eng. 2019;07:262–7.
Google Scholar
Wang X, van Harmelen F, Huang Z. Recommending scientific datasets using author networks in ensemble methods. 2022.
Nawaz A, Malik M. Rising stars prediction in reviewer network. Electron Commer Res. 2022;22(1):53–75.
Article Google Scholar
Hirsch JE. An index to quantify an individual’s scientific research output. Proc Natl Acad Sci. 2005;102(46):16569–72.
Article MATH Google Scholar

Download references

Funding

We do not have any funding for this article.

Author information

Sovan Bhattacharya, Ayan Banerjee, Abhik Goswami, Subrata Nandi and Dinesh Kumar Pradhan contributed equally to this work.

Authors and Affiliations

Department of CSE, Dr. B. C. Roy Engineering College, Jemua Road, Durgapur, West Bengal, 713206, India
Sovan Bhattacharya, Ayan Banerjee, Abhik Goswami & Dinesh Kumar Pradhan
Department of CSE, National Institute of Technology, Mahatma Gandhi Road, Durgapur, West Bengal, 713209, India
Sovan Bhattacharya & Subrata Nandi

Authors

Sovan Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar
Ayan Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Abhik Goswami
View author publications
You can also search for this author in PubMed Google Scholar
Subrata Nandi
View author publications
You can also search for this author in PubMed Google Scholar
Dinesh Kumar Pradhan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors have equal contributions to the article.

Corresponding author

Correspondence to Sovan Bhattacharya.

Ethics declarations

Conflict of interest

We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Ethics approval

The Author confirms that the work described has not been published before, and it is not under consideration for publication elsewhere.

Consent to participate

The Authors agree to participate in the conference.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Enabling Innovative Computational Intelligence Technologies for IOT” guest edited by Omer Rana, Rajiv Misra, Alexander Pfeiffer, Luigi Troiano and Nishtha Kesswani.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Bhattacharya, S., Banerjee, A., Goswami, A. et al. Machine Learning Based Approach for Future Prediction of Authors in Research Academics. SN COMPUT. SCI. 4, 306 (2023). https://doi.org/10.1007/s42979-023-01692-6

Download citation

Received: 30 September 2022
Accepted: 09 January 2023
Published: 01 April 2023
DOI: https://doi.org/10.1007/s42979-023-01692-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Machine Learning Based Approach for Future Prediction of Authors in Research Academics

Abstract

Access this article

Similar content being viewed by others

Feature Selection Using Extra Trees Classifier for Research Productivity Framework in Indonesia

A Data Mining Approach for Predicting Academic Success – A Case Study

Machine Learning Applied to the H Index of Colombian Authors with Publications in Scopus

Notes

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Machine Learning Based Approach for Future Prediction of Authors in Research Academics

Abstract

Access this article

Similar content being viewed by others

Feature Selection Using Extra Trees Classifier for Research Productivity Framework in Indonesia

A Data Mining Approach for Predicting Academic Success – A Case Study

Machine Learning Applied to the H Index of Colombian Authors with Publications in Scopus

Notes

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation