The limited value of cophenetic correlation as a clustering criterion

https://doi.org/10.1016/0031-3203(78)90038-9Get rights and content

Abstract

A common procedure for evaluating hierarchical cluster techniques is to compare the input data, in terms of for example a matrix of similarities or dissimilarities, with the output hierarchy expressed in matrix form. If an ordinary product-moment correlation is used for this comparison, the technique is known as that of cophenetic correlations, frequently used by numerical taxonomists. A high correlation between the input similarities and the output dendrogram has been regarded as a criterion of a successful classification. This paper contains a Monte Carlo study of the characteristics of the cophenetic correlation and a related measure of agreement which have been both interpreted in terms of generalized variance for some different hierarchical cluster algorithms. The generalized variance criterion chosen for this study is Wilk's lambda, whose sampling distribution under the null hypothesis of identical group centroids is used in this context to define the degree of separation between clusters. Thus, a probabilistic approach is introduced into the evaluation procedure. With the above definition of presence of clusters, use of the cophenetic correlation and related measures of agreement as criteria of goodness-of-fit is shown to be quite misleading in most cases. This is due to their large variability for low separation of clusters.

References (22)

  • M.R. Anderberg

    Cluster Analysis for Applications

    (1973)
  • J.W. Carmichael et al.

    Finding natural cluster

    Syst. Zool.

    (1968)
  • R.O. Duda et al.

    Pattern Classification and Scene Analysis

    (1973)
  • J.S. Farris

    On the cophenetic correlation coefficient

    Syst. Zool.

    (1969)
  • W.D. Fisher

    On grouping for maximal homogenity

    J. Am. Statist. Assoc.

    (1958)
  • H.P. Friedmann et al.

    On some invariant criteria for grouping data

    J. Am. Statist. Assoc.

    (1967)
  • J.C. Gower

    Statistical methods for comparing different multivariate analyses of the same data

  • J.C. Gower

    Classification problems

    Bull. Int. Statist. Inst.

    (1973)
  • J.C. Gower et al.

    Goodness-of-fit criteria in cluster analysis and their empirical distributions

  • J.A. Hartigan

    Representation of similarity matrices by trees

    J. Am. Statist. Assoc.

    (1967)
  • M. Holgersson

    An interpretation of two measures of agreement between the dissimilarity matrix and the resulting hierarchy in terms of generalized variance for some different cluster algorithms

  • Cited by (21)

    • Assessing the efficiency of clustering algorithms and goodness-of-fit measures using phytoplankton field data

      2012, Ecological Informatics
      Citation Excerpt :

      The cophenetic correlation coefficient c based on Pearson's correlation (Legendre and Legendre, 1998; Sokal and Rohlf, 1962) is currently the most commonly used approach (e.g. Blackburn et al., 2005; Kreft and Jetz, 2010; May, 1999) for assessing the most faithful dendrogram (i.e. the dendrogram that preserves most faithfully D). However, it is known that c may not always be a reliable measure of distortion generated by algorithms (Farris, 1969; Holgersson, 1978; Mérigot et al., 2010). For instance, Mérigot et al. (2010) raised three criticisms on the reliability of information given by c measure: (i) it is only a measure of intensity of the monotonic linear relationship between D and U; (ii) it is sensitive to extreme values; and (iii) a c close to 1 indicates a perfect correspondence of D and U whereas the match between the two matrices can in fact be weak.

    • Vowel systems and accent similarity in the British Isles: Exploiting multidimensional acoustic distances in phonetics

      2010, Journal of Phonetics
      Citation Excerpt :

      Average linkage is known to be more robust than other standard methods such as single, complete linkage or Ward's method (Everitt, Landau, & Leese, 2001, pp. 59–64), and it was successfully used by Huckvale (2007) in a study whose principles are very close to our own, although Huckvale concentrated on clustering accents only, not vowels. Incidentally, among standard agglomeration methods, average linkage is also known to yield the highest cophenetic correlation coefficients (Holgersson, 1978, p. 291). The cophenetic correlation coefficient measures the correlation (the closer to 1, the stronger the positive correlation) between the original distance matrix and the corresponding distances as they appear in the tree.

    • The uses and benefits of cluster analysis in pharmacy research

      2008, Research in Social and Administrative Pharmacy
      Citation Excerpt :

      The cophenetic correlation compares the dendrogram results from CA to the structure of the original dataset. However, the use of the cophenetic correlation has been discouraged,47,55 and it is only applicable to hierarchical CA. Finally, the proposed cluster solution should be subjected to a form of replication.

    • Regionalization of watersheds by hybrid-cluster analysis

      2006, Journal of Hydrology
      Citation Excerpt :

      In essence, CPCC appears ineffective in suggesting optimal partitioning scheme for the Indiana dataset. Detailed discussion on the performance of CPCC can be found in Holgersson (1978). The average silhouette width (ASW), which has a feasible range from −1 to +1, varied generally within a narrow range of 0.31–0.46 for the Indiana data over the variety of clustering options considered.

    • Lattice-based similarity measures between ordered trees

      1982, Journal of Mathematical Psychology
    View all citing articles on Scopus
    View full text