Skip to main content
Log in

Note: t for Two (Clusters)

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

The computation for cluster analysis is done by iterative algorithms. But here, a straightforward, non-iterative procedure is presented for clustering in the special case of one variable and two groups. The method is univariate but may reasonably be applied to multivariate datasets when the first principal component or a single factor explains much of the variation in the data. The t method is motivated by the fact that minimizing the within-groups sum of squares is equivalent to maximizing the between-groups sum of squares, and that Student’s t statistic measures the between-groups difference in means relative to within-groups variation. That is, the t statistic is the ratio of the difference in sample means, divided by the standard error of this difference. So, maximizing the t statistic is developed as a method for clustering univariate data into two clusters. In this situation, the t method gives the same results as the K-means algorithm. K-means tacitly assumes equality of variances; here, however, with t, equality of variances need not be assumed because separate variances may be used in computing t. The t method is applied to some datasets; the results are compared with those obtained by fitting mixtures of distributions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

  • Connor, L.R., & Morrell, A.J.H. (1977). Statistics in theory and practice, 7th Edn. London: Pitman.

    Google Scholar 

  • Kenkel, J.L. (1984). Introductory statistics for management and economics, (p. 31). Boston: Duxbury Press. Exercise 4.

    Google Scholar 

  • MacQueen, J.B. (1967). Some methods for classification and analysis of multivariate observations. In Proc. Fifth Berkeley symp. on math. statist. and prob., (Vol. 1 pp. 281–297).

  • Steinhaus, H. (1956). Sur la division des corps materiels en parties. Bulletin l’Académie Polonaise des Science (Bulletin of the Polish Academy of Science) (in French), 4(12), 801–804.

    MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stanley L. Sclove.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

“Tea for Two” is a well-known song from the 1925 musical “No, No, Nanette”, music by Vincent Youmans, lyrics by Irving Caesar.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sclove, S.L. Note: t for Two (Clusters). J Classif 36, 435–441 (2019). https://doi.org/10.1007/s00357-019-09335-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-019-09335-3

Keywords

Navigation