Skip to main content

K-Means and K-Medoids

  • Reference work entry
  • First Online:
Encyclopedia of Database Systems
  • 60 Accesses

Synonyms

CLARA (Clustering LARge Applications); CLARANS (Clustering large applications based upon randomized search); K-means partition; PAM (Partitioning Around Medoids)

Definitions

K-Means

Given an integer k and a set of objects S = {p1, p2,…,pn} in Euclidian space, the problem of k-means clustering is to find a set of centre points (means) P = {c1, c2,…,ck}, |P| = k in the space, such that S can be partitioned into k corresponding clusters C1, C2,…,Ck, by assigning each object in S to the closest centre ci. The sum of square error criterion (SEC) measure, defined as \( {\displaystyle {\sum}_{i=1}^k{\displaystyle \sum_{p\in {C}_i}\Big|p-{c}_i}}\Big|{}^2 \), is minimized.

K-Medoids

Given an integer k and a set of objects S = {p1, p2, …, pn} in Euclidian space, the problem of k-medoids clustering is to find a set of objects as medoids P = {o1, o2,…,ok}, |P| = k in the space, such that S can be partitioned into k corresponding clusters C1, C2,…,Ck, by assigning each object in Sto the...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 4,499.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 6,499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Kaufman L, Rousseeuw PJ. Finding groups in data: an introduction to cluster analysis. New York: Wiley; 1990.

    Book  Google Scholar 

  2. MacQueen J. Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley Symposium on Mathematics, Statistics and Probabilities; 1967. p. 281–97.

    Google Scholar 

  3. Ng RT, Han J. Efficient and effective clustering methods for spatial data mining. In: Proceedings of the 20th International Conference on Very Large Databases; 1994. p. 144–55.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xue Li .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Li, X. (2018). K-Means and K-Medoids. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_545

Download citation

Publish with us

Policies and ethics