Abstract
Advisor–advisee relationships in academic genealogy offer opportunities to understand the contribution of a mentor in shaping the research community. In this paper, we adapt the bibliometric g-index to study the mentorship role of a researcher. We call the new index \(g_m\)-index. It has some important differences from the mentorship h-index or the \(h_m\)-index. We compute the values of the \(h_m\) and \(g_m\) indices for researchers indexed in the Mathematics Genealogy Project and the Academic Family Tree. We observe for the majority of researchers, these index values are zero, but in non-zero cases, sometimes, the \(g_m\)-index can be significantly higher than the \(h_m\)-index. Moreover, the \(g_m\)-index decays less rapidly to zero than the \(h_m\)-index. It appears the \(g_m\)-index can be used to discriminate between researchers with the same \(h_m\)-index. We study how these mentorship indices are correlated with other indicators of academic success, and how they are correlated across generations of researchers.
Similar content being viewed by others
References
Alonso, S., Cabrerizo, F. J., Herrera-Viedma, E., & Herrera, F. (2009). h-index: A review focused in its variants, computation and standardization for different scientific fields. Journal of Informetrics, 3(4), 273–289.
Andraos, J. (2005). Scientific genealogies of physical and mechanistic organic chemists. Canadian Journal of Chemistry, 83(9), 1400–1414.
Bornmann, L., Mutz, R., & Daniel, H. D. (2008). Are there better indices for evaluation purposes than the h index? A comparison of nine different variants of the h index using data from biomedicine. Journal of the American Society for Information Science and Technology, 59(5), 830–837.
Cai, L., Tian, J., Liu, J., Bai, X., Lee, I., Kong, X., et al. (2019). Scholarly impact assessment: A survey of citation weighting solutions. Scientometrics, 118(2), 453–478.
Chariker, J. H., Zhang, Y., Pani, J. R., & Rouchka, E. C. (2017). Identification of successful mentoring communities using network-based analysis of mentor–mentee relationships across nobel laureates. Scientometrics, 111(3), 1733–1749.
Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2009). Introduction to algorithms. Cambridge: MIT Press.
Costas, R., & Bordons, M. (2007). The h-index: Advantages, limitations and its relation with other bibliometric indicators at the micro level. Journal of Informetrics, 1(3), 193–203.
Costas, R., & Bordons, M. (2008). Is g-index better than h-index? An exploratory study at the individual level. Scientometrics, 77(2), 267–288.
Costas, R., & Franssen, T. (2018). Reflections around ‘the cautionary use’ of the h-index: Response to Teixeira da Silva and Dobránszki. Scientometrics, 115(2), 1125–1130.
Da Silva, J. A. T., & Dobránszki, J. (2018). Multiple versions of the h-index: Cautionary use for formal academic purposes. Scientometrics, 115(2), 1107–1113.
David, S. V. (January 3, 2019). Private communication (e-mail).
David, S. V., & Hayden, B. Y. (2012). Neurotree: A collaborative, graphical database of the academic genealogy of neuroscience. PloS ONE, 7(10), e46608.
Diehl, P. F. (March 15, 2018). Turning good teaching on its head: Part I. Retrieved June 19, 2019 from https://www.insidehighered.com/advice/2018/03/15/learning-opposite-good-teaching-opinion.
Dores, W., Benevenuto, F., & Laender, A. H. (2016). Extracting academic genealogy trees from the networked digital library of theses and dissertations. In Proceedings of the 16th ACM/IEEE-CS joint conference on digital libraries (pp. 163–166). ACM
Dores, W., Soares, E., Benevenuto, F., & Laender, A. H. (2017). Building the Brazilian academic genealogy tree. In Proceedings of the international conference on theory and practice of digital libraries (pp. 537–543). Springer.
Egghe, L. (2006). Theory and practise of the g-index. Scientometrics, 69(1), 131–152.
Ferreira, A. A., Gonçalves, M. A., & Laender, A. H. (2012). A brief survey of automatic methods for author name disambiguation. ACM SIGMOD Record, 41(2), 15–26.
Gargiulo, F., Caen, A., Lambiotte, R., & Carletti, T. (2016). The classical origin of modern mathematics. EPJ Data Science, 5(1), 26.
Hart, R. E., & Cossuth, J. H. (2013). A family tree of tropical meteorology’s academic community and its proposed expansion. Bulletin of the American Meteorological Society, 94(12), 1837–1848.
Hirsch, J. E. (2005). An index to quantify an individual’s scientific research output. Proceedings of the National Academy of Sciences, 102(46), 16569–16572.
Hirshman, B. R., Tang, J. A., Jones, L. A., Proudfoot, J. A., Carley, K. M., Marshall, L., et al. (2016). Impact of medical academic genealogy on publication patterns: an analysis of the literature for surgical resection in brain tumor patients. Annals of Neurology, 79(2), 169–177.
Jackson, A. (2007). A labor of love: The mathematics genealogy project. Notices of the American Mathematical Society, 54(8), 1002–1003.
James, J. M., Rayner, A., & Bruno, J. (2015). Are you my mentor? New perspectives and research on informal mentorship. The Journal of Academic Librarianship, 41(5), 532–539.
Kelley, E. A., & Sussman, R. W. (2007). An academic genealogy on the history of American field primatologists. American Journal of Physical Anthropology, 132(3), 406–425.
Liu, J., Xia, F., Wang, L., Xu, B., Kong, X., Tong, H., et al. (2019). Shifu2: A network representation learning based model for advisor–advisee relationship mining. IEEE Transactions on Knowledge and Data Engineering,. https://doi.org/10.1109/TKDE.2019.2946825.
Malmgren, R. D., Ottino, J. M., & Amaral, L. A. N. (2010). The role of mentorship in protégé performance. Nature, 465(7298), 622–626.
Marsh, E. J. (2017). Family matters: Measuring impact through one’s academic descendants. Perspectives on Psychological Science, 12(6), 1130–1132.
Paglis, L. L., Green, S. G., & Bauer, T. N. (2006). Does adviser mentoring add value? A longitudinal study of mentoring and doctoral student outcomes. Research in Higher Education, 47(4), 451–476.
Rossi, L., Damaceno, R. J., Freire, I. L., Bechara, E. J., & Mena-Chalco, J. P. (2018). Topological metrics in academic genealogy graphs. Journal of Informetrics, 12(4), 1042–1058.
Rossi, L., Freire, I. L., & Mena-Chalco, J. P. (2017). Genealogical index: A metric to analyze advisor–advisee relationships. Journal of Informetrics, 11(2), 564–582.
Russell, T. G., & Sugimoto, C. R. (2009). MPACT family trees: Quantifying academic genealogy in library and information science. Journal of Education for Library and Information Science, 50(4), 248–262.
Sugimoto, C. R. (2012). Are you my mentor? Identifying mentors and their roles in lis doctoral education. Journal of Education for Library and Information Science, 53(1), 2–19.
Sugimoto, C. R., Ni, C., Russell, T. G., & Bychowski, B. (2011). Academic genealogy as an indicator of interdisciplinarity: An examination of dissertation networks in library and information science. Journal of the Association for Information Science and Technology, 62(9), 1808–1828.
Tuesta, E. F., Delgado, K. V., Mugnaini, R., Digiampietri, L. A., Mena-Chalco, J. P., & Pérez-Alcázar, J. J. (2015). Analysis of an advisor–advisee relationship: An exploratory study of the area of exact and earth sciences in Brazil. PloS ONE, 10(5), e0129065.
Wang, C., Han, J., Jia, Y., Tang, J., Zhang, D., Yu, Y., & Guo, J. (2010). Mining advisor–advisee relationships from research publication networks. In Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining (pp. 203–212). ACM.
Zhao, Z., Liu, W., Qian, Y., Nie, L., Yin, Y., & Zhang, Y. (2018). Identifying advisor–advisee relationships from co-author networks via a novel deep model. Information Sciences, 466, 258–269.
Acknowledgements
This work is supported by the National Digital Library of India Project sponsored by the Ministry of Human Resource Development, Government of India at IIT Kharagpur. We thank Luciano Rossi for kindly sharing the MGP dataset studied in Rossi et al. (2017) and Stephen V. David for kindly sharing a snapshot of the AFT dataset (David 2019). We express our heartfelt gratitude to the anonymous reviewers for their very insightful and constructive feedback to improve the manuscript.
Funding
This work was supported by Ministry of Human Resource Development (IN) (Grant No. F.No.16-7/2017-TEL).
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: Distribution of \(h_m\) and \(g_m\) indices in MGP
Here we analyze the general nature of the plot in Fig. 6 and its underlying reasons. By Corollary 1 in "Some properties of the gm-index" section, the number of researchers with \(h_m=1\) is always same as or greater than the number of researchers with \(g_m=1\). In the present case, there are \(6.5\%\) more researchers with \(h_m=1\) than with \(g_m=1\). In the \(g_m\)-index space, these researchers are distributed among the bins for higher \(g_m\)-index. Indeed, for index values other than one in Fig. 6, the frequency of researchers at a given \(g_m\)-index is always observed to be greater than that at the same \(h_m\)-index (unless both are zero). Let us explore this in greater detail. Denote the number of researchers having \(h_m=\beta\) as \(M(\beta )\) and the number of researchers having \(g_m=\beta\) as \(N(\beta )\). For \(\beta = 0\), \(M(\beta ) = N(\beta )\) by Theorem 2. For \(\beta = 1\), \(M(\beta ) \ge N(\beta )\) by Corollary 1. For \(\beta >12\), \(M(\beta ) = 0\) (from Fig. 6), therefore, \(N(\beta ) > M(\beta )\) trivially wherever \(N(\beta ) \ne 0\). Therefore, let us focus on \(2 \le \beta \le 12\). Observe that around \(40\%\) of researchers with \(h_m=2\) and around \(17\%\) of researchers with \(h_m=1\) possess \(g_m=2\). The frequency of researchers at a given \(h_m\)-index falls steeply with \(h_m\)-index. In particular, the frequency of researchers with \(h_m=1\) is almost four times the number of researchers with \(h_m=2\), i.e., \(M(1) \approx 4M(2)\). Thus, \(N(2) \ge 0.4M(2) + 0.17M(1) = 1.1 M(2)\). In other words, \(N(2) > M(2)\). Similarly, when \(\beta =3\), \(N(3) >0.18M(3) + 0.36M(2)\). When \(\beta =4\), \(N(4) > 0.33M(3) + 0.16M(2)\). Since \(M(2) \approx 3M(3)\) and \(M(3) \approx 2M(4)\), we have \(N(3) >M(3)\) and \(N(4) > M(4)\). Thus, for \(2 \le \beta \le 4\), \(N(\beta )> M(\beta )\). To infer the ordering between \(N(\beta )\) and \(M(\beta )\) for \(5 \le \beta \le 12\), it is sufficient to consider only the most frequent \(h_m\)-index among the researchers with \(g_m=\beta\). Let us call the most frequently occurring \(h_m\)-index among researchers with \(g_m=\beta\) as the largest \(h_m\)-contributor to \(g_m=\beta\). (There can be more than one largest \(h_m\)-contributor to a given \(g_m\)-index.) As expected, the largest \(h_m\)-contributor typically has \(h_m < \beta\). For example, the largest \(h_m\)-contributor to \(g_m=5\) is \(h_m=3\). Similarly, the largest \(h_m\)-contributor to \(g_m=8\) is \(h_m=5\). In particular, there are 747 researchers with \(g_m=5\) out of which 367 have \(h_m=3\), and there are 196 researchers with \(g_m=8\) among whom 87 have \(h_m=5\), and in both these cases, no other \(h_m\)-index is more common in the corresponding \(g_m\)-index set. If the largest \(h_m\)-contributor (in \(g_m=\beta\)) has \(h_m=\beta -\delta\), and its contribution is a fraction \(0 < q \le 1\) of its own population size \(M(\beta -\delta )\), then clearly, \(N(\beta ) > q M(\beta - \delta )\). As the size \(M(\beta )\) falls fast with \(\beta\) (e.g., \(M(5) \approx M(3)/4\) and \(M(7) \approx M(5)/8\)), it turns out, for \(5 \le \beta \le 12\), that \(N(\beta ) > M(\beta )\) even if this largest contributor contributes only a small fraction of its own population size. For example, for \(g_m=5\), the largest contributor is \(h_m=3\) and it contributes \(q = 0.3\) of its frequency M(3). Hence, the number of researchers with \(g_m=5\) is \(N(5)> q M (3) \approx 0.3 \times 4 M(5) = 1.2 M(5) > M(5)\). Due to this behavior, we find \(N(\beta ) > M(\beta )\) for \(5 \le \beta \le 12\). The above discussion is a quantitative analysis of the variation in the frequency of researchers with various \(h_m\)-index and \(g_m\)-index values, and how researchers with a certain \(h_m\)-index get distributed among various \(g_m\)-indices.
Appendix 2: Mathematicians with high mentorship index
Tables 2 and 3 show mathematicians who graduated in the twentieth and nineteenth centuries respectively, in order of their \(g_m\)-index. We restrict to mathematicians whose \(h_m\)-index is at least 10. The data are taken from the MGP snapshot in Rossi et al. (2017). The 1st column of each table mentions the identifier of the researcher in MGP, the 2nd column mentions the name of the researcher, the 3rd column mentions the researcher’s \(g_m\)-index, and the 4th column mentions the researcher’s \(h_m\)-index. The mentorship fecundity is shown in the 5th column and the total number of descendants in the 6th column. The year in which the researcher was awarded the doctoral degree is shown in column 7, while column 8 mentions the academic institution that awarded the degree. The 9th column, Active mentoring life, mentions the temporal span bounded by the years in which the first and the last PhDs were granted under the researcher’s supervision. The last column mentions the top-3 most prolific DDs of the researcher, i.e., the protégés who produced the highest number of PhDs. It is written in the form Name (id in MGP, number of DDs). For example, in Table 2, the most prolific DD of Jacques-Louis Lions is Roger Temam, whose MGP id is 11498 and who has 118 DDs. A very high \(g_m\)-index indicates the presence of very prolific DDs among the protégés of a researcher. This can help distinguish mentors with the same \(h_m\)-index. The highest \(g_m\)-index and \(h_m\)-index in the twentieth century are 23 and 12, respectively. They are 17 and 11, respectively in the nineteenth century. In the MGP data for pre-nineteenth century, the highest \(g_m\)-index is 5 and the highest \(h_m\)-index is 2.
Rights and permissions
About this article
Cite this article
Sanyal, D.K., Dey, S. & Das, P.P. gm-index: a new mentorship index for researchers. Scientometrics 123, 71–102 (2020). https://doi.org/10.1007/s11192-020-03384-x
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-020-03384-x