skip to main content
10.1145/3105971.3105979acmotherconferencesArticle/Chapter ViewAbstractPublication PagesvinciConference Proceedingsconference-collections
research-article

Towards Glyph-based visualizations for big data clustering

Published: 14 August 2017 Publication History

Abstract

Data Analysts have to deal with an ever-growing amount of data resources. One way to make sense of this data is to extract features and use clustering algorithms to group items according to a similarity measure. Algorithm developers are challenged when evaluating the performance of the algorithm since it is hard to identify features that influence the clustering. Moreover, many algorithms can be trained using a semi-supervised approach, where human users provide ground truth samples by manually grouping single items. Hence, visualization techniques are needed that help data analysts achieve their goal in evaluating Big data clustering algorithms. In this context, Multidimensional Scaling (MDS) has become a prominent visualization tool. In this paper, we propose a combination with glyphs that can provide a detailed view of specific features involved in MDS. In consequence, human users can understand, adjust, and ultimately improve clustering algorithms. We present a thorough glyph design, which is founded in a comprehensive survey of related work and report the results of a controlled experiments, where participants solved data analysis tasks with both glyphs and a traditional textual display of data values.

References

[1]
P. Berkhin. 2006. A survey of clustering data mining techniques. In Grouping multidimensional data. Springer, 25--71.
[2]
E. Bertini and D. Lalanne. 2009. Surveying the Complementary Role of Automatic Data Analysis and Visualization in Knowledge Discovery. In Proc. of the ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery: Integrating Automated Analysis with Interactive Exploration (VAKD '09). ACM, New York, NY, USA, 12--20.
[3]
E. Bertini and D. Lalanne. 2010. Investigating and Reflecting on the Integration of Automatic Data Analysis and Visualization in Knowledge Discovery. SIGKDD Explor. Newsl. 11, 2 (May 2010), 9--18.
[4]
R. Borgo, J. Kehrer, D.H.S Chung, E. Maguire, R.S. Laramee, H. Hauser, M. Ward, and M. Chen. 2013. Glyph-based Visualization: Foundations, Design Guidelines, Techniques and Applications. Eurographics State of the Art Reports (May 2013), 39--63. https://www.cg.tuwien.ac.at/research/publications/2013/borgo-2013-gly/
[5]
M. Chau. 2011. Visualizing Web Search Results Using Glyphs: Design and Evaluation of a Flower Metaphor. ACM Trans. Manage. Inf. Syst. 2, 1, Article 2 (March 2011), 27 pages.
[6]
H. Chernoff. 1973. The use of faces to represent points in k-dimensional space graphically. J. Amer. Statist. Assoc. 68 (1973), 361--368.
[7]
D.H.S. Chung, D. Archambault, R. Borgo, D.J. Edwards, R.S. Laramee, and M. Chen. 2016. How Ordered is It?: On the Perceptual Orderability of Visual Channels. In Proc. of the Eurographics/IEEE VGTC Conference on Visualization (EuroVis '16). Eurographics Association, Goslar Germany, Germany, 131--140.
[8]
M. de Almeida Madeira Clemente, M. Keck, and R. Groh. 2014. TagStar: A Glyph-based Interface for Indexing and Visual Analysis. In Proc. of the 2014 International Working Conference on Advanced Visual Interfaces (AVI '14). ACM, New York, NY, USA, 357--358.
[9]
V. Estivill-Castro. 2002. Why so many clustering algorithms: a position paper. ACM SIGKDD explorations newsletter 4, 1 (2002), 65--75.
[10]
M. C. Ferreira de Oliveira and H. Levkowitz. 2003. From Visual Data Exploration to Visual Data Mining: A Survey. IEEE Trans. on Visualization and Computer Graphics 9, 3 (Jul. 2003), 378--394.
[11]
J. Fuchs, F. Fischer, F. Mansmann, E. Bertini, and P. Isenberg. 2013. Evaluation of Alternative Glyph Designs for Time Series Data in a Small Multiple Setting. In Proc. of the SIGCHI Conference on Human Factors in Computing Systems (CHI '13). ACM, New York, NY, USA, 3237--3246.
[12]
J. Han and M. Kamber. 2000. Data mining: concepts and techniques (the Morgan Kaufmann Series in data management systems). (2000).
[13]
H. Hotelling. 1933. Analysis of a Complex of Statistical Variables with Principal Components. Journal of Educational Psychology 24 (1933), 417--441.
[14]
Daniel A. Keim. 2000. Designing Pixel-Oriented Visualization Techniques: Theory and Applications. IEEE Trans. on Visualization and Computer Graphics 6, 1 (Jan. 2000), 59--78.
[15]
C. Kintzel, J. Fuchs, and F. Mansmann. 2011. Monitoring Large IP Spaces with ClockView. In Proc. of the 8th International Symposium on Visualization for Cyber Security (VizSec '11). ACM, New York, NY, USA, Article 2, 10 pages.
[16]
A. Kirk. 2016. Data Visualisation: A Handbook for Data Driven Design. Sage Publications Ltd. 368 pages.
[17]
B. Laugwitz, T. Held, and M. Schrepp. 2008. Construction and Evaluation of a User Experience Questionnaire. Springer Berlin Heidelberg, Berlin, Heidelberg, 63--76.
[18]
J. A. Lee and M. Verleysen. 2007. Nonlinear dimensionality reduction. Springer Science & Business Media.
[19]
J. Mackinlay. 1986. Automating the Design of Graphical Presentations of Relational Information. ACM Trans. Graph. 5, 2 (April 1986), 110--141.
[20]
T. Ropinski, S. Oeltze, and B. Preim. 2011. Survey of Glyph-based Visualization Techniques for Spatial Multivariate Medical Data. Computers & Graphics (March/April 2011). http://viscg.uni-muenster.de/publications/2011/ROP11
[21]
B. Shneiderman. 1996. The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations. In Proc. of the 1996 IEEE Symposium on Visual Languages (VL '96). IEEE Computer Society, Washington, DC, USA, 336-. http://dl.acm.org/citation.cfm?id=832277.834354
[22]
W.S. Torgerson. 1952. Multidimensional scaling: I. Theory and method. Psychometrika 17 (1952), 401--419.
[23]
L. van der Maaten and G.E. Hinton. 2008. Visualizing High-Dimensional Data Using t-SNE. Journal of Machine Learning Research 9 (2008), 2579--2605.
[24]
M.O. Ward. 2008. Multivariate Data Glyphs: Principles and Practice. Springer Berlin Heidelberg, Berlin, Heidelberg, 179--198.
[25]
C. Ware. 2012. Information Visualization: Perception for Design (3 ed.). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
[26]
R. Xiong and J. Donath. 1999. PeopleGarden: Creating Data Portraits for Users. In Proc. of the 12th Annual ACM Symposium on User Interface Software and Technology (UIST '99). ACM, New York, NY, USA, 37--44.

Cited By

View all
  • (2024)Visual analysis of fitness landscapes in architectural design optimizationThe Visual Computer10.1007/s00371-024-03491-340:7(4927-4940)Online publication date: 17-Jun-2024
  • (2023)Out of the Plane: Flower versus Star Glyphs to Support High-Dimensional Exploration in Two-Dimensional EmbeddingsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.321691929:12(5468-5482)Online publication date: Dec-2023
  • (2022)Sparkle Glyphs: A Glyph Design for the Analysis of Temporal Multivariate Audio FeaturesProceedings of the 2022 International Conference on Advanced Visual Interfaces10.1145/3531073.3534491(1-3)Online publication date: 6-Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
VINCI '17: Proceedings of the 10th International Symposium on Visual Information Communication and Interaction
August 2017
158 pages
ISBN:9781450352925
DOI:10.1145/3105971
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

  • KMUTT: King Mongkut's University of Technology Thonburi

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Glyph-based visualization techniques
  2. big data
  3. multidimensional scaling
  4. visual cluster analysis

Qualifiers

  • Research-article

Funding Sources

  • Free State of Saxony
  • European Regional Development Fund

Conference

VINCI '17
Sponsor:
  • KMUTT

Acceptance Rates

VINCI '17 Paper Acceptance Rate 12 of 27 submissions, 44%;
Overall Acceptance Rate 71 of 193 submissions, 37%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)27
  • Downloads (Last 6 weeks)2
Reflects downloads up to 22 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Visual analysis of fitness landscapes in architectural design optimizationThe Visual Computer10.1007/s00371-024-03491-340:7(4927-4940)Online publication date: 17-Jun-2024
  • (2023)Out of the Plane: Flower versus Star Glyphs to Support High-Dimensional Exploration in Two-Dimensional EmbeddingsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.321691929:12(5468-5482)Online publication date: Dec-2023
  • (2022)Sparkle Glyphs: A Glyph Design for the Analysis of Temporal Multivariate Audio FeaturesProceedings of the 2022 International Conference on Advanced Visual Interfaces10.1145/3531073.3534491(1-3)Online publication date: 6-Jun-2022
  • (2021)A data science approach to drug safety: Semantic and visual mining of adverse drug events from clinical trials of pain treatmentsArtificial Intelligence in Medicine10.1016/j.artmed.2021.102074115(102074)Online publication date: May-2021
  • (2020)Glyphboard: Visual Exploration of High-Dimensional Data Combining Glyphs with Dimensionality ReductionIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2020.296906026:4(1661-1671)Online publication date: 1-Apr-2020
  • (2020)Comparison of four visual analytics techniques for the visualization of adverse drug event rates in clinical trials2020 24th International Conference Information Visualisation (IV)10.1109/IV51561.2020.00063(344-349)Online publication date: Sep-2020
  • (2019)MetricsVis: A Visual Analytics System for Evaluating Employee Performance in Public Safety AgenciesIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2019.2934603(1-1)Online publication date: 2019
  • (2019)Evaluation of Effectiveness of Glyphs to Enhance ChronoView2019 23rd International Conference Information Visualisation (IV)10.1109/IV.2019.00035(157-162)Online publication date: Jul-2019
  • (2018)Big data landscapesProceedings of the 2018 International Conference on Advanced Visual Interfaces10.1145/3206505.3206556(1-3)Online publication date: 29-May-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media