Clustering of Variables with Missing Data: Application to Preference Studies

Sahmer, Karin; Vigneau, Evelyne; El Qannari, Mostafa; Kunert, Joachim

doi:10.1007/3-540-28084-7_22

Karin Sahmer²¹,
Evelyne Vigneau²¹,
Mostafa El Qannari²¹ &
…
Joachim Kunert²²

Part of the book series: Studies in Classification, Data Analysis, and Knowledge Organization ((STUDIES CLASS))

2359 Accesses

Abstract

Clustering of variables around latent components is a means of organizing multivariate data into meaningful subgroups. We extend the approach to situations with missing data. A straightforward method is to replace the missing values by some estimates and cluster the completed data set. This basic imputation method is improved by more sophisticated procedures which update the imputations within each group after an initial clustering of the variables. We compare the performance of the different imputation methods with the help of a simulation study.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An empirical likelihood approach under cluster sampling with missing observations

Article 03 August 2018

Seemingly unrelated clusterwise linear regression for contaminated data

Article Open access 06 August 2022

Clustering data with non-ignorable missingness using semi-parametric mixture models assuming independence within components

Article 12 February 2023

References

CALLIER, P. (1996): La cartographie des préférences. Son application en milieu industriel et son extension aux plans incomplets. Université Montpellier II (Doctorat en biostatistique).
Google Scholar
GREENHOFF, K. and Mac FIE, H.J.H. (1994): Preference mapping in practice. In: H.J.H. Mac Fie and D.M.H. Thomson (Eds.): Measurement of food preferences, Blackie academic & professional, 137–166.
Google Scholar
LEBART, L., MORINEAU, A. and PIRON, M. (2000): Statistique exploratoire multidimensionnelle, 3^ième édition. Dunod, Paris.
Google Scholar
SAHMER, K. (2003): Classification des variables en présence de données manquantes: Application aux données de préférence. Diplomarbeit, Fachbereich Statistik, Universität Dortmund.
Google Scholar
SAS/STAT (1999): User's guide, Version 8, SAS Institute Inc., Cary, North Carolina.
Google Scholar
VIGNEAU, E. and QANNARI, E.M. (2002): Segmentation of consumers taking account of external data. A clustering of variables approach. Food Quality and Preference, 13, 515–521.
Article Google Scholar
VIGNEAU, E. and QANNARI, E.M. (2003): Clustering of variables around latent components. Communications in Statistics — Simulation and Computation, 32, 1131–1150.
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire de sensométrie et de chimiométrie, ENITIAA / INRA, rue de la Géraudière, BP 82 225, F-44322, Nantes Cedex 03, France
Karin Sahmer, Evelyne Vigneau & Mostafa El Qannari
Fachbereich Statistik, Universität Dortmund, D-44221, Dortmund, Germany
Joachim Kunert

Authors

Karin Sahmer
View author publications
You can also search for this author in PubMed Google Scholar
Evelyne Vigneau
View author publications
You can also search for this author in PubMed Google Scholar
Mostafa El Qannari
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Kunert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fachbereich Statistik, Universität Dortmund, 44221, Dortmund
Claus Weihs
Institut für Entscheidungstheorie und Unternehmensforschung, Universität Karlsruhe (TH), 76128, Karlsruhe
Wolfgang Gaul

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sahmer, K., Vigneau, E., El Qannari, M., Kunert, J. (2005). Clustering of Variables with Missing Data: Application to Preference Studies. In: Weihs, C., Gaul, W. (eds) Classification — the Ubiquitous Challenge. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-28084-7_22

Download citation

DOI: https://doi.org/10.1007/3-540-28084-7_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25677-9
Online ISBN: 978-3-540-28084-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics