Paper
20 January 2009 Visually comparing multiple partitions of data with applications to clustering
Author Affiliations +
Proceedings Volume 7243, Visualization and Data Analysis 2009; 72430J (2009) https://doi.org/10.1117/12.810093
Event: IS&T/SPIE Electronic Imaging, 2009, San Jose, California, United States
Abstract
Tightly coupled visualization and analysis is a powerful approach to data exploration especially for clustering. We describe such a specific integration of analysis and visualization for the evaluation of multiple partitions of a data set. Partitions are decompositions of a dataset into a family of disjoint subsets. They may be the results of clustering, of groupings of categorical dimensions, of binned numerical dimensions, of predetermined class labeling dimensions, or of prior knowledge structured in mutually exclusive format (one data item associated with one and only one outcome). Partition or cluster stability analysis can be used to identify near-optimal structures, build ensembles, or conduct validation. We extend Parallel Sets to a new visualization tool which provides for the mutual comparison and evaluation of multiple partitions of the same dataset. We describe a novel layout algorithm for informatively rearranging the order of records and dimensions. We provide examples of its application to data stability and correlation at the record, cluster, and dimension levels within a single interactive display.
© (2009) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jianping Zhou, Shawn Konecni, and Georges Grinstein "Visually comparing multiple partitions of data with applications to clustering", Proc. SPIE 7243, Visualization and Data Analysis 2009, 72430J (20 January 2009); https://doi.org/10.1117/12.810093
Lens.org Logo
CITATIONS
Cited by 18 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Visual analytics

Data modeling

Remote sensing

Copper

Algorithm development

Data analysis

Back to Top