Abstract
Aim
The present article contributes to the current methodological debate concerning author co-citation analyses. (ACA) The study compares two different units of analyses, i.e. first- versus inclusive all-author co-citation counting, as well as two different matrix generation approaches, i.e. a conventional multivariate and the so-called Drexel approach, in order to investigate their influence upon mapping results. The aim of the present study is therefore to provide more methodological awareness and empirical evidence concerning author co-citation studies.
Method
The study is based on structured XML documents extracted from the IEEE collection. These data allow the construction of ad-hoc citation indexes, which enables us to carry out the hitherto largest all-author co-citation study. Four ACA are made, combining the different units of analyses with the different matrix generation approaches. The results are evaluated quantitatively by means of multidimensional scaling, factor analysis, Procrustes and Mantel statistics.
Results
The results show that the inclusion of all cited authors can provide a better fit of data in two-dimensional mappings based on MDS, and that inclusive all-author co-citation counting may lead to stronger groupings in the maps. Further, the two matrix generation approaches produce maps that have some resemblances, but also many differences at the more detailed levels. The Drexel approach produces results that have noticeably lower stress values and are more concentrated into groupings. Finally, the study also demonstrates the importance of sparse matrices and their potential problems in connection with factor analysis.
Conclusion
We can confirm that inclusive all-ACA produce more coherent groupings of authors, whereas the present study cannot clearly confirm previous findings that first-ACA identifies more specialties, though some vague indication is given. Most crucially, strong evidence is given to the determining effect that matrix generation approaches have on the mapping of author co-citation data and thus the interpretation of such maps. Evidence is provided for the seemingly advantages of the Drexel approach.
Similar content being viewed by others
References
Ahlgren, P., Jarneving, B., Rousseau, R. (2003), Requirements for a cocitation similarity measure, with special reference to Pearson’s correlation coefficient. Journal of the American Society for Information Science and Technology, 54(6): 550–560.
Borg, I., Groenen, P. J. F. (2005), Modern Multidimensional Scaling: Theory and applications. 2nd. Springer Science & Business Media: New York.
Boyack, K. (2004), Mapping knowledge domains: Characterizing PNAS. Proceedings of the National Academy of Sciences, 101: 5192–5199.
Chen, C. (1999), Visualising semantic spaces and author co-citation networks in digital libraries. Information Processing & Management, 35(3): 401–420.
Cronbach, L. J. (1951), Coefficient alpha and the internal structure of tests. Psychometrika, 16(3): 297–334.
Eom, S. B. (2003), Author Cocitation Analysis using Custom Bibliographical Databases: An Introduction to the SAS systems. The Edwin Mellen Press, Lewiston, New York.
Giles, C. L., Bollacker, K., Lawrence, S. (1998), CiteSeer: An Automatic Citation Indexing System. In: Third ACM Conference on Digital Libraries. ACM Press, New York, pp. 89–98.
Glänzel, W. (1996), The need for standards in bibliometric research and technology, Scientometrics, 35(2): 167–176.
Gower, J. C. (1971), Statistical methods for comparing different multivariate analyses of the same data. In: Hodson, Kendall, Tautu (Eds), Mathematics in the Archaeological and Historical Sciences. Edinburgh: Edinburgh University Press, pp. 138–149.
Klavans, D., Boyack, K. (2006), Identifying a better measure of relatedness for mapping science. Journal of the American Society for Information Science and Technology, 57(2): 251–263.
Lattin, J., Carroll, J. D., Green, P. E. (2003), Analyzing Multivariate Data. Pacific Grove, CA: Brooks/Cole - Thompson Learning.
Leydesdorff, L., Bensman, S. (2006), Classification and powerlaws: The logarithmic transformation. Journal of the American Society for Information Science and Technology, 57(11): 1470–1486.
Leydesdorff, L., Vaughan, L. (2006), Co-occurrence matrices and their application in information science: Extending ACA to the Web Environment. Journal of the American Society for Information Science and Technology, 57(12): 1616–1628.
Lok, C. K. W, Chan, M. T. W., Martinson, I. M. (2001), Risk factors for citation errors in peer-reviewed nursing journals. Journal of Advanced Nursing, 32(2): 223–229.
Malik, S., Kazai, G., Lalmas, M., Fuhr, N. (2006), Overview of INEX 2005. In: Fuhr & AL.: Proceedings of INEX 2005. Berlin: Springer 2006, pp. 1–15. (LNCS 3977)
Mantel, N. (1967), A technique of disease clustering and a generalized regression approach. Cancer Research, 27: 209–220.
McCain, K. W. (1990), Mapping authors in intellectual space: A technical overview. Journal of the American Society for Information Science, 41(6): 433–443.
Mulaik, S. A. (1972), The foundations of factor analysis. New York: McGraw-Hill.
Nunnally, J. C. (1978), Psychometric theory. 2nd edition. New York: McGraw-Hill.
Persson, O. (2001), All author citations versus first author citations. Scientometrics, 50(2): 339–344.
Price, D. J. de Solla (1981), The analysis of square matrices of scientometric transaction, Scientometrics, 3(1) (1981) 55–63.
Rousseau, R., Zuccala, A. (2004), A classification of author co-citations: definitions and search strategies. Journal of the American Society for Information Science and Technology, 55(6): 513–629.
Schneider, J. W., Borlund, P. (2007a): Matrix comparison, Part 1: Motivation and important issues for measuring the resemblance between proximity measures or ordination results. Journal of the American Society for Information Science and Technology, 58(11): 1586–1595.
Schneider, J. W., Borlund, P. (2007b): Matrix comparison, Part 2: Measuring the resemblance between proximity measures or ordination results by use of the Mantel and Procrustes statistics. Journal of the American Society for Information Science and Technology, 58(11): 1596–1609.
Schneider, J. W., Larsen, B., Ingwersen, P. (2007), Comparative study between first and all-author cocitation analysis based on citation indexes generated from XML data. In: Proceedings of ISSI 2007, 11 th International Conference of the International Society for Scientometrics and Informetrics, Eds. Torres-Salinas, D. & Moed, H. CINCDOC, Madrid, pp. 696–707.
Schönemann, P. H., Carroll, R. M. (1970), Fitting one matrix to another under choice of a central dilation and a rigid motion. Psychometrika, 35(2): 245–256.
Shannon, C. E., Weaver, W. (1949), The Mathematical Theory of Communication. University of Illinois
White, H. D. (2003a), Pathfinder networks and author cocitation analysis: A remapping of paradigmatic information scientists. Journal of the American Society for Information Science and Technology, 54(5): 423–434.
White, H. D. (2003b), Author Cocitation Analysis and Pearson’s r. Journal of the American Society for Information Science and Technology, 54(31): 250–259.
White, H. D., Griffith, B. C. (1981), Author co-citation: A literature measure of intellectual structure. Journal of the American Society for Information Science, 32(3): 163–171.
White, H. D., McCain, K. (1998), Visualizing a discipline: An author cocitation analysis of information science, 1972–1995. Journal of the American Society for Information Science, 49(4): 327–355.
Zhao, D. (2006), Towards all-author co-citation analysis. Information Processing & Management, 42 (6): 1578–1591.
Zhao, D., Strotmann, A. (2007), Can citation analysis of web publications better detect research fronts? Journal of the American Society for Information Science and Technology, 58(9): 1285–1302.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Schneider, J.W., Larsen, B. & Ingwersen, P. A comparative study of first and all-author co-citation counting, and two different matrix generation approaches applied for author co-citation analyses. Scientometrics 80, 103–130 (2009). https://doi.org/10.1007/s11192-007-2019-y
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-007-2019-y