Data-driven artificial intelligence to automate researcher assessment

Weber, Rosina O.; Duarte, Kedma B.

doi:10.1007/s11192-020-03859-x

Data-driven artificial intelligence to automate researcher assessment

Published: 05 February 2021

Volume 126, pages 3265–3281, (2021)
Cite this article

Scientometrics Aims and scope Submit manuscript

564 Accesses
6 Altmetric
Explore all metrics

Abstract

This article describes how to utilize data-driven artificial intelligence (AI) to automate researcher assessment using data from profiling systems. We consider that a researcher assessment is done for a purpose and not divorced from a specific target placement. We formulate researcher assessment as a binary classification task, that is, a candidate researcher is classified as either fit or unfit for a given placement. For classifying researchers, we adopt case-based reasoning, a transparent artificial intelligence methodology that implements analogical reasoning, allows adaptation, machine learning, and explainability. This work addresses a human limitation through AI. Given a small number of candidates for a job or award and a clear job description, even if capable of selecting the best fit candidate, human decisions may be neither transparent nor reproducible. The approach in this article describes how to use AI methods to, from a job description, select the best fit candidate while considering career trajectories, providing explanations, and being reproducible. We describe the implementation of the methodology for a hypothetical placement in a real research institute from real but anonymized curriculum vitae from the Brazilian Lattes Database. We describe an experiment demonstrating that the purpose-oriented approach is more accurate than purpose-independent classifiers. The proposed methodology meets various principles from the Leiden Manifesto.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The rise of artificial intelligence – understanding the AI identity threat at the workplace

Article Open access 05 October 2021

Literature reviews as independent studies: guidelines for academic practice

Article Open access 14 October 2022

Is artificial intelligence improving the audit process?

Article Open access 07 July 2022

Availability of data and material

Data is not made available because it is under IP. It was obtained through a partnership for this work.

Code availability

Authors are not making code available because the implementation is not generic enough for reuse.

References

Aamodt, A., & Plaza, E. (1994). Case-based reasoning: foundational issues, methodological variations, and system approaches. AI communications. IOS Press, 7(1), 39–59.
Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., et al. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58, 82–115.
Article Google Scholar
Bennett, B. M., & Underwood, R. E. (1970). 283. Note: On McNemar’s test for the 2 × 2 table and its power function. Biometrics, 26(2), 339–343. https://doi.org/10.2307/2529083.
Byrne, R. M. J. (2019). Counterfactuals in explainable artificial intelligence (XAI): Evidence from human reasoning. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19) (pp. 6276–6282).
CNPq (2020). National Council for Scientific and Technological Development. Plataforma Lattes. Accessed 28 Nov 2020 http://lattes.cnpq.br
DGP (2020). Directory of research groups in Brazil. Microcephaly Epidemic Research Group. Accessed 28 Nov 2020 http://dgp.cnpq.br/dgp/espelhogrupo/2723404431935999
Doyle, D., Cunningham, P., & Walsh, P. (2006). An evaluation of the usefulness of explanation in a CBR system for decision support in bronchiolitis treatment. Computational Intelligence Journal, 22(3–4), 269–281.
Article Google Scholar
Duarte, K. B., Weber, R. O., & Pacheco, R. C. S. (2016). Case-cased comparison of career trajectories. In ICCBR 2016 Workshops (Vol. 1815, pp. 152–161). Atlanta, GA, USA.
Duarte, K. B. (2017). Assessing researcher quality for collaborative purposes. Doctoral dissertation, Federal University of Santa Catarina (UFSC), Florianópolis, Brazil. Retrieved from http://btd.egc.ufsc.br/?p=2495
Microcephaly Epidemic Research Group. (2016). Microcephaly in infants, Pernambuco state, Brazil, 2015. Emerging infectious Diseases, 22(6), 1090–1093.
Article Google Scholar
Hall, M. A. (1999). Correlation-based Feature Selection for Machine Learning. Doctoral dissertation, University of Waikato, Hamilton, NewZealand. https://doi.org/10.1.1.37.4643
Hicks, D., Wouters, P., Waltman, L., Rijcke, S., & Rafols, I. (2015). The Leiden manifesto for research metrics. Nature, 520(7548), 429–431.
Article Google Scholar
Johs, A., Lutts, M., & Weber, R. (2018). Measuring explanation quality in XCBR. In Proceedings of ICCBR 2018 (pp. 75–78).
Kenny, E. M., & Keane, M. T. (2019). Twin-systems to explain artificial neural networks using case-based reasoning: Comparative tests of feature-weighting methods in ANN-CBR twins for XAI. In Proceedings of the Twenty-Eighth International Joint Conferences on Artificial Intelligence (IJCAI), (pp. 2708–2715).
Kim, B., Rudin, C., & Shah, J. (2014). The Bayesian case model: A generative approach for case-based reasoning and prototype classification. In Advances in neural information processing systems (pp. 1952–1960). http://www.arXiv:1503.01161v1. [stat.ML]. https://proceedings.neurips.cc/paper/2014/file/390e982518a50e280d8e2b535462ec1f-Paper.pdf.
Kohavi, R., & John, G. H. (1997). Wrappers for feature subset selection. Artificial Intelligence, 97(1997), 273–324.
Article Google Scholar
Leake, D. (1992). Evaluating Explanations: A Content Theory. Hillsdale, NJ: Lawrence Erlbaum.
Google Scholar
Liu, B., Dai, Y., Li, X., Lee, W. S., & Yu, P. S. (2003). Building Text Classifiers Using Positive and Unlabeled Examples. In Proceedings of the Third IEEE International Conference on Data Mining (p. 179). USA: IEEE Computer Society.
Mitchell, R., Michalski, J., & Carbonell, T. (2013). An artificial intelligence approach. Berlin: Springer.
Google Scholar
Moor, J. (2006). The Dartmouth College artificial intelligence conference: The next fifty years. AI Magazine, 27(4), 87–87. https://doi.org/10.1609/aimag.v27i4.1911.
Article Google Scholar
Parliament, E. (2016). Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation).
Richter, M. M., & Weber, R. O. (2013). Case-based reasoning: A textbook. Berlin: Springer-Verlag.
Book Google Scholar
Sørmo, F., Cassens, J., & Aamodt, A. (2005). Explanation in case-based reasoning–perspectives and goals. Artificial Intelligence Review, 24(2), 109–143.
Article Google Scholar
Wettschereck, D., Aha, D. W., & Mohri, T. (1997). A review and empirical evaluation of feature weighting methods for a class of lazy learning algorithms. Artificial Intelligence Review, 11, 273–314. https://doi.org/10.1023/A:1006593614256.
Article Google Scholar

Download references

Acknowledgements

The authors thank and acknowledge Professor Roberto C. S. Pacheco for his contributions to earlier versions of this work and for the use of anonymized data from the Lattes Database through the agreement between the Stela Institute and the Goiás State Research Support Foundation (FAPEG), Brazil, where Kedma B. Duarte held a position at the time of this research.

Funding

The authors were not funded during the period this research was conducted.

Author information

Authors and Affiliations

Information Science, Drexel University, Philadelphia, PA, USA
Rosina O. Weber
Information System, Goias State University, Anápolis, GO, Brazil
Kedma B. Duarte

Authors

Rosina O. Weber
View author publications
You can also search for this author in PubMed Google Scholar
Kedma B. Duarte
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both authors contributed equally. The first author was primarily responsible for the conception of the article and the second author primarily responsible for conducting experiments.

Corresponding author

Correspondence to Rosina O. Weber.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Weber, R.O., Duarte, K.B. Data-driven artificial intelligence to automate researcher assessment. Scientometrics 126, 3265–3281 (2021). https://doi.org/10.1007/s11192-020-03859-x

Download citation

Received: 04 August 2020
Accepted: 27 December 2020
Published: 05 February 2021
Issue Date: April 2021
DOI: https://doi.org/10.1007/s11192-020-03859-x

Keywords

Mathematical Subject Classification

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data-driven artificial intelligence to automate researcher assessment

Abstract

Access this article

Similar content being viewed by others

The rise of artificial intelligence – understanding the AI identity threat at the workplace

Literature reviews as independent studies: guidelines for academic practice

Is artificial intelligence improving the audit process?

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Mathematical Subject Classification

JEL Classification

Navigation

Data-driven artificial intelligence to automate researcher assessment

Abstract

Access this article

Similar content being viewed by others

The rise of artificial intelligence – understanding the AI identity threat at the workplace

Literature reviews as independent studies: guidelines for academic practice

Is artificial intelligence improving the audit process?

Availability of data and material

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematical Subject Classification

JEL Classification

Search

Navigation