Abstract
Human factors researchers often collect qualitative data that involve statements about a system or tool. Establishing consistent patterns in such data is important for making conclusions about the data. When a theoretically motivated coding scheme has not been established, one might use card sorting techniques to have independent raters generate a similarity space in order to create a bottom-up taxonomy. In this paper, we will explore how clustering and scaling techniques can be used to derive a common taxonomy from multiple car sorting results and judge how consistent the groupings are. We examine this process on two datasets, one with the qualitative data from an interview study with physicians and another with the data regarding a website design for usability purposes. Results showed that the different clustering methods had very similar high-level results, and these had high within-group similarity across groups, suggesting inter-rater reliability. Finally, we will discuss the benefits of different algorithms for clustering and scaling and propose measures to assess strong consensus and inter-group sorting reliability.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Paea, S., Baird, R.: Information architecture (IA): using multidimensional scaling (MDS) and K-Means clustering algorithm for analysis of card sorting data. J. Usability Stud. 13(3) (2018)
Paul, C.L.: Analyzing card-sorting data using graph visualization. J. Usability Stud. 9(3) (2014)
Katsanos, C., Tselios, N., Avouris, N., Demetriadis, S., Stamelos, I., Angelis, L.: Cross-study reliability of the open card sorting method. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, pp. 1–6 (2019)
Nawaz, A.: A comparison of card-sorting analysis methods. In: 10th Asia Pacific Conference on Computer Human Interaction (Apchi 2012), Matsue-city, Shimane, Japan, pp. 28–31 (2012)
Braun, V., Clarke, V.: Using thematic analysis in psychology. Qual. Res. Psychol. 3, 77–101 (2006)
Guest, G., MacQueen, K.M., Namey, E.E.: Applied Thematic Analysis. Sage publications, Thousand Oaks (2011)
Maechler, M., Rousseeuw, P., Struyf, A., Hubert, M., Hornik, K., Studer, M.: Package ‘cluster.’ Dosegljivo Na. (2013)
Fraley, C., Raftery, A.E., Scrucca, L., Murphy, T.B., Fop, M., Scrucca, M.L.: Package ‘mclust.’ (2012)
Ripley, B., et al.: Package ‘mass.’ Cran R. 538 (2013)
Mueller, S.T., Veinott, E.S.: Cultural mixture modeling: Identifying cultural consensus (and disagreement) using finite mixture modeling. In: Proceedings of the Cognitive Science Society (2008)
Tan, Y.-Y., Mueller, S.T.: Adapting cultural mixture modeling for continuous measures of knowledge and memory fluency. Behav. Res. Methods 48(3), 843–856 (2015). https://doi.org/10.3758/s13428-015-0670-4
Romney, A.K., Weller, S.C., Batchelder, W.H.: Culture as consensus: a theory of culture and informant accuracy. Am. Anthropol. 88, 313–338 (1986)
Gamer, M., Lemon, J., Gamer, M.M., Robinson, A., Kendall’s, W.: Package ‘irr.’ Var. Coeff. Interrater Reliab. Agreem. (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Alam, L., Mueller, S.T. (2021). Assessing Clustering Methods to Establish Reliability and Consensus in Card Sorting Tasks. In: Ahram, T.Z., Falcão, C.S. (eds) Advances in Usability, User Experience, Wearable and Assistive Technology. AHFE 2021. Lecture Notes in Networks and Systems, vol 275. Springer, Cham. https://doi.org/10.1007/978-3-030-80091-8_114
Download citation
DOI: https://doi.org/10.1007/978-3-030-80091-8_114
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80090-1
Online ISBN: 978-3-030-80091-8
eBook Packages: EngineeringEngineering (R0)