Abstract
The model proposed by Burrell (Information Processing and Management 28:637–645, 1992, Journal of Informetrics 1:16–25, 2007a) to describe the way that an individual author’s publication/citation career develops in time is investigated further, the aim being to describe in more detail the form of the citation distribution and the way it evolves over time. Both relative and actual frequency distributions are considered. Theoretical aspects are developed analytically and graphically and then illustrated using small empirical data sets relating to some well-known informetrics scholars. Perhaps surprisingly, it is found that the distribution may well be approximated in some cases by a simple geometric distribution.
Similar content being viewed by others
References
Ahlgren, P., Jarneving, B., & Rousseau, R. (2003). Requirements for a cocitation similarity measure, with special reference to Pearson’s correlation coefficient. Journal of the American Society for Information Science and Technology, 54(6), 550–560.
Burrell, Q. L. (1980). A simple stochastic model for library loans. Journal of Documentation, 36(2), 115–132.
Burrell, Q. L. (1990). Using the gamma-Poisson model to predict library circulations. Journal of the American Society for Information Science, 41(3), 164–170.
Burrell, Q. L. (1992). A simple model for linked informetric processes. Information Processing and Management, 28(5), 637–645.
Burrell, Q. L. (2001). Stochastic modelling of the first-citation distribution. Scientometrics, 52, 3–12.
Burrell, Q. L. (2002a). On the nth-citation distribution and obsolescence. Scientometrics, 53, 309–323.
Burrell, Q. L. (2002b). Will this paper ever be cited? Journal of the American Society for Information Science and Technology, 53, 232–235.
Burrell, Q. L. (2003). Predicting future citation behavior. Journal of the American Society for Information Science and Technology, 54(5), 372–378.
Burrell, Q. L. (2005a). The use of the generalised Waring process in modelling informetric data. Scientometrics, 64(3), 247–270.
Burrell, Q. L. (2005b). Are “Sleeping Beauties” to be expected? Scientometrics, 65(3), 381–389.
Burrell, Q. L. (2007a). Hirsch’s h-index: a stochastic model. Journal of Informetrics, 1(1), 16–25.
Burrell, Q. L. (2007b). Hirsch index or Hirsch rate? Some thoughts arising from Liang’s data. Scientometrics, 73(1), 19–28.
Burrell, Q. L. (2007c). On the h-index, the size of the Hirsch core and Jin’s A-index. Journal of Informetrics, 1(2), 170–177.
Burrell, Q. L. (2007d). Hirsch’s h-index and Egghe’s g-index. In D. Torres-Salinas & H. F. Moed (Eds.), Proceedings of ISSI 2007 (Vol. 1, pp. 162–169). Madrid: Centre for Scientific Information and Documentation (CINDOC).
Burrell, Q. L. (2007e). On Hirsch’s h, Egghe’s g and Kosmulski’s h(2). Scientometrics, 79(1), 79–91.
Burrell, Q. L. (2008). Extending Lotkaian informetrics. Information Processing and Management, 44(5), 1794–1807.
Burrell, Q. L. (2009). The publication/citation process at the micro level: A case study. Journal of Scientometrics and Information Management, 3(1), 71–77.
Burrell, Q. L. (2012). Alternative thoughts on uncitedness. Journal of the American Society for Information Science and Technology, 63(7), 1466–1470.
Burrell, Q. L. (2013). Formulae for the h-index: A lack of robustness in Lotkaian informetrics? Journal of the American Society for Information Science and Technology, (Accepted for publication).
Burrell, Q. L., & Cane, V. R. (1982). The analysis of library data. (With discussion.). Journal of the Royal Statistical Society (Series A), 145(4), 439–471.
Burrell, Q. L., & Fenton, M. R. (1993). Yes, the GIGP really does work—and is workable! Journal of the American Society for Information Science, 44(2), 61–69.
Egghe, L. (2006). Theory and practice of the g-index. Scientometrics, 69(1), 131–152.
Egghe, L. (2010). The Hirsch index and related impact measures. In B. Cronin (Ed.), Annual Review of Information Science and Technology (Vol. 44, pp. 65–114). Medford, NJ: Information Today.
Egghe, L., Guns, R., & Rousseau, R. (2011). Thoughts on uncitedness: Nobel laureates and Fields Medalists as case studies. Journal of the American Society for Information Science and Technology, 62(8), 1637–1644.
Egghe, L., & Rousseau, R. (2006). An informetric model for the Hirsch-index. Scientometrics, 69(1), 121–129.
Egghe, L., & Rousseau, R. (2012a). Theory and practice of the shifted Lotka function. Scientometrics, 91(1), 295–301.
Egghe, L., & Rousseau, R. (2012b). The Hirsch index of a shifted Lotka function and its relation with the impact factor. Journal of the American Society for Information Science and Technology, 63(5), 1048–1053.
Etkowitz, H., & Leydesdorff, L. (2000). The dynamics of innovation: from National Systems and Mode 2 to a Triple Helix of university-industry-government relations. Research Policy, 29(2), 109–123.
Morse, P. M. (1976). Demand for library materials: an exercise in probability analysis. Collection Management, 1, 47–78.
Ross, S. (1996). Stochastic processes (2nd ed.). New York: John Wiley.
Schubert, A., Glanzel, W., & Braun, T. (1989). Scientometric datafiles—A comprehensive set of indicators on 2649 journals and 96 countries in all major science fields and subfields 1981–1985. Scientometrics, 16(1–6), 3–478.
Sichel, H. S. (1985). A bibliometric distribution which really works. Journal of the American Society for Information Science, 36, 314–321.
Stirzaker, D. (2005). Stochastic processes and models. Oxford: Oxford University Press.
Xekalaki, E., & Zografi, M. (2008). The generalized Waring process and its applications. Communications in Statistics - Theory and Methods, 37, 1835–1854.
Zografi, M., & Xekalaki, E. (2001). The generalised Waring process. In E. A. Lypitakis (Ed.), Proceedings of the 5th Hellenic-European Conference on Computer Mathematics and its Applications, Athens (pp. 886–893). Athens: HERCMA.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
In the proofs we make free use of standard results regarding Poisson processes. The reader can refer to such standard references as Ross (1996) and Stirzaker (2005).
Proof of Proposition 2
The proof is constructed by conditioning on T, the time since publication, and Λ, the citation rate of a paper.
Given T and Λ, we have that citations following a Poisson distribution with mean ΛT so that the pgf is given by
Now averaging over T, which has a uniform distribution on [0, t], we find
Finally averaging over Λ, which has a gamma distribution as in (2), we get
(recognising each of the integrands as being proportional to a gamma pdf)
(where s = t/α)
where, as before, \( p = \frac{1}{1 + s} = \frac{\alpha }{\alpha + t} = 1 - q \)
For the power series representation, this follows from expanding the first term inside the square brackets of (A2) using the general binomial expansion for negative powers and then simplifying. Thus
Now dividing through by \( (\nu - 1)s(z - 1) \) gives the result.□
Remark
The power series representation could alternatively have been derived by expanding (A1) in standard power series form and then integrating term by term with respect to the gamma pdf of Λ.
Proof of Proposition 3
This is an application of a well-known result for probability generating functions. If the pgf G(z) of a random variable X is differentiated wrt z r times then
so that \( G^{(r)} (1) = E\left[ {X(X - 1) \ldots (X - r + 1)} \right] \)
Thus successively differentiating the power series expansion (7) and putting z = 1 each time gives the result.□
Rights and permissions
About this article
Cite this article
Burrell, Q.L. The individual author’s publication–citation process: theory and practice. Scientometrics 98, 725–742 (2014). https://doi.org/10.1007/s11192-013-1018-4
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-013-1018-4