Skip to main content
Log in

Using ‘core documents’ for the representation of clusters and topics

Scientometrics Aims and scope Submit manuscript

Abstract

The notion of ‘core documents’, first introduced in the context of co-citation analysis and later re-introduced for bibliographic coupling, refers to the representation of the core of a publication set according to given criteria. In the present study, the notion of core documents is extended to the combination of citation-based and textual links. It is shown that core documents defined this way can be used to represent and describe document clusters and topics at different levels of aggregation. Methodology is illustrated using the example of two ISI Subject Categories selected from applied and social sciences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Batagelj, V., & Mrvar, A. (2003). Pajek—analysis and visualization of large networks. In M. Jünger & P. Mutzel (Eds.), Graph drawing software (pp. 77–103). New York: Springer.

    Google Scholar 

  • Boyack, K.W., & Klavans, R. (2010). Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? Journal of the American Society for Information Science and Technology, 61(12), 2389–2404.

    Article  Google Scholar 

  • Braam, R. R., Moed, H. F., & van Raan, A. F. J. (1991a). Mapping of science by combined cocitation and word analysis, Part 1: Structural aspects. Journal of the American Society for Information Science, 42(4), 233–251.

    Article  Google Scholar 

  • Braam, R. R., Moed, H. F., & van Raan, A. F. J. (1991b). Mapping of science by combined cocitation and word analysis, Part II: Dynamical aspects. Journal of the American Society for Information Science, 42(4), 252–266.

    Article  Google Scholar 

  • Glänzel, W., & Czerwon, H. J. (1996). A new methodological approach to bibliographic coupling and its application to the national, regional and institutional level. Scientometrics, 37, 195–221.

    Article  Google Scholar 

  • Glenisson, P., Glänzel, W., Janssens, F., & de Moor, B. (2005). Combining full text and bibliometric information in mapping scientific disciplines. Information Processing & Management, 41(6), 1548–1572.

    Article  Google Scholar 

  • Janssens, F., Zhang, L., & Glänzel, W. (2009). Hybrid clustering for validation and improvement of subject-classification schemes. Information Processing & Management, 45(6), 683–702.

    Article  Google Scholar 

  • Lamirel, J.C., Ta A.P., & Attik, M. (2008), Novel labeling strategies for hierarchical representation of multidimensional data analysis results. In: A. Gammerman (Ed.), Proceedings of the 26th IASTED International Conference on Artificial Intelligence and Applications (Track 595-138, pp. 169–174), 11–13 Feb 2008, Innsbruck, Austria. Anaheim, CA: ACTA Press.

  • Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65.

    Article  MATH  Google Scholar 

  • Sen, S. K., & Gan, S. K. (1983). A mathematical extension of the idea of bibliographic coupling and its applications. Annals of Library Science and Documentation, 30, 78–82.

    Google Scholar 

  • Small, H. (1973). Cocitation in scientific literature—new measure of relationship between 2 documents. JASIS, 24(4), 265–269.

    Article  Google Scholar 

  • Zhang, L., Glänzel, W., & Liang, L. (2009). Tracing the role of individual journals in a cross-citation network based on different indicators. Scientometrics, 81(3), 821–838.

    Article  Google Scholar 

  • Zitt, M., & Bassecoulard, E. (1994). Development of a method for detection and trend analysis of research fronts built by lexical or cocitation analysis. Scientometrics, 30(1), 333–351.

    Article  Google Scholar 

Download references

Acknowledgments

Methodology has partially been developed in the context of the ERACEP project within the Coordination and Support Actions (CSAs) of the ERC work programme. The authors wish to acknowledge this support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wolfgang Glänzel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Glänzel, W., Thijs, B. Using ‘core documents’ for the representation of clusters and topics. Scientometrics 88, 297–309 (2011). https://doi.org/10.1007/s11192-011-0347-4

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-011-0347-4

Keywords

Navigation