Consensus Clustering

doi:10.1007/978-1-4899-7687-1_162

201 Accesses

Synonyms

Clustering aggregation; Clustering ensembles

Definition

In Consensus Clustering we are given a set of n objects V, and a set of m clusterings {C₁, C₂, …, C_m} of the objects in V. The aim is to find a single clustering C that disagrees least with the input clusterings, that is, C minimizes

$$\displaystyle\begin{array}{rcl} D(C) =\sum _{C_{i}}d(C,C_{j}),& & {}\\ \end{array}$$

for some metric d on clusterings of V. Meilă (2003) proposed the principled variation of information metric on clusterings, but it has been difficult to analyze theoretically. The Mirkin metric is the most widely used, in which d(C, C′) is the number of pairs of objects (u, v) that are clustered together in C and apart in C′, or vice versa; it can be calculated in time O(mn).

We can interpret each of the clusterings C_i in Consensus Clustering as evidence that pairs ought be put together or separated. That is, w_uvⁱ is the number of C_i in which C_i[u] = C_i[v] and w_uv⁻ is the number of C_...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 699.99; Price excludes VAT (USA)

Hardcover Book: USD 949.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Editor information

Editors and Affiliations

The University of New South Wales, Sydney, NSW, Australia
Claude Sammut
Faculty of Information Technology, Monash University, Melbourne, VIC, Australia
Geoffrey I. Webb

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

(2017). Consensus Clustering. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_162

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7687-1_162
Published: 14 April 2017
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7685-7
Online ISBN: 978-1-4899-7687-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics