Abstract
The quality, certainty, or confidence of decisions made during the visual analytics process depends on many factors, including the completeness and reliability of the initial data, information loss due to filtering, sampling, and other transformations, and the accuracy and clarity of the visual presentation. Unfortunately, in most visualization tools the analyst is unaware of these and other forces that degrade the meaningfulness of their results. In this paper, we describe our efforts to design strategies for tackling the measurement, display, and utilization of quality aspects at all stages of the visualization pipeline. The goal is to help analysts maintain an awareness of the accuracy and completeness of the information conveyed in the images, and subsequently the patterns observed and decisions made based on the analysis. Quality measures can be used both to assist analysts in selecting, transforming, and mapping their data as well as to automatically refine processes to generate higher quality views. We have implemented several such techniques within XmdvTool, a public-domain package for visual analytics. We illustrate the quality-specific components of our tool with several case studies to show the usefulness of the approach. We also describe user studies that were performed to validate the accuracy of our quality measures.
Similar content being viewed by others
References
Ahlberg C, Shneiderman B (1994) Visual information seeking using the filmfinder. In: Proceedings of the ACM SIGCHI conference on human factors in computing systems, 2:433
Amar R, Stasko J (2004) A knowledge task-based framework for design and evaluation of information visualizations. In: Proceedings of the IEEE symposium on information visualization, pp 143–150
Asuncion A, Newman D (2007) UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html
Bertini E, Santucci G (2004) Quality metrics for 2d scatterplot graphics: automatically reducing visual clutter. In: Proceedings of 4th international symposium on smartGraphics, pp 77–89
Boutin F, Hascoet M (2004) Cluster validity indices for graph partitioning. In: Eighth international conference on information visualisation (IV’04), pp 376–381
Carreira-Perpinan M (1997) A review of dimension reduction techniques. Tech. Rep. CS–96–09, Dept. of Computer Science, University of Sheffield, http://faculty.ucmerced.edu/mcarreira-perpinan/papers/cs-96-09.pdf
Cui Q, Ward M, Rundensteiner E, Yang J (2006) Measuring data abstraction quality in multiresolution visualizations. IEEE Trans Vis Comput Graph 12(5): 709–716
Dix A, Ellis G (2002) By chance—enhancing interaction with large data sets through statistical sampling. In: Proceedings of advanced visual interfaces, pp 167–176
Djurcilov S, Kim K, Lermusiaux P, Pang A (2002) Visualizing scalar volumetric data with uncertainty. Comput Graph 26(2): 239–248
Duda R, Hart P, Stork D (2001) Pattern classification. 2nd edn. Wiley, London
Friendly M, Kwan E (2003) Effect ordering for data displays. Comput Stat Data Anal 43: 509–539
Fua Y, Ward M, Rundensteiner E (2000) Structure-based brushes: a mechanism for navigating hierarchically organized data and information spaces. IEEE Trans Vis Comput Graph 6(2): 150–159
Haase H (1998) Mirror, mirror on the wall, who has the best visualization of all? a reference model for visualization quality. In: Proceedings of visualization in scientific computing ’98, pp 117–128
Hofmann H, Theus M (1998) Selection sequences in MANET. Comput Stat 13(1): 77–88
Hunter G (1999) New tools for handling spatial data quality: moving from academic concepts to practical reality. URISA J 11(2): 25–34
Luo A, Kao D, Pang A (2003) Visualizing spatial distribution data sets. In: VISSYM ’03: proceedings of the symposium on data visualisation 2003, Eurographics Association, Aire-la-Ville, Switzerland, pp 29–38
MacEachren AM (1992) Visualizing uncertain information. Cartogr Perspect 13: 10–19
Martin A, Ward M (1995) High dimensional brushing for interactive exploration of multivariate data. In: Proceedings of the IEEE visualization, pp 271–278
Olston C, Mackinlay J (2002) Visualizing data with bounded uncertainty. In: Proceedings of the IEEE symposium on information visualization, pp 37–40
Pang A (2001) Visualizing uncertainty in geo-spatial data. In: Proceedings of worshop on the intersections between geospatial information and information technology
Pang A, Wittenbrink C, Lodha S (1997) Approaches to uncertainty visualization. Vis Comput 13(8): 370–390
Peng W, Ward M, Rundensteiner E (2004) Clutter reduction in multi-dimensional data visualization using dimension reordering. In: Proceedings of the IEEE symposium on information visualization, pp 89–96
Rao R, Card S (1994) The table lens: merging graphical and symbolic representations in an interactive focus+context visualization for tabular information. In: Proceedings of ACM SIGCHI conference on human factors in computing systems, pp 318–322
Rosenholtz R, Li Y, Mansfield J, Jin Z (2005) Feature congestion: a measure of display clutter. In: CHI ’05: Proceedings of the SIGCHI conference on human factors in computing systems, pp 761–770
Sanyal J, Zhang S, Bhattacharya G, Amburn P, Moorhead R (2009) A user study to compare four uncertainty visualization methods for 1D and 2D datasets. IEEE Trans Vis Comput Graph 15(6): 1209–1218
Swayne D, Buja A (1998) Missing data in interactive high-dimensional data visualization. Comput Stat 13(1): 15–26
Tekušová TT, Knuth M, Schreck T, Kohlhammer J (2008) Data quality visualization for multivariate hierarchic data. In: InfoVis demo. http://www.gris.informatik.tu-darmstadt.de/~tschreck/papers/infovis08-poster.pdf
Unwin A, Hawkins G, Hofmann H, Siegl B (1996) Interactive graphics for data sets with missing values—MANET. J Comput Graph Stat 5(2): 113–122
Wang C, Ma K (2008) A statistical approach to volume data quality assessment. IEEE Trans Vis Comput Graph 14(3): 590–602
Ward M (1994) Xmdvtool: Integrating multiple methods for visualizing multivariate data. In: Proceedings of the IEEE visualization, pp 326–333
Wilkinson L, Anand A, Grossman RL (2005) Graph-theoretic scagnostics. In: Proceedings of the IEEE symposium on information visualization, pp 157–164
Wittenbrink C, Pang A, Lodha S (1996) Glyphs for visualizing uncertainty in vector fields. IEEE Trans Vis Comput Graph 2(3): 266–279
Xie Z, Huang S, Ward MO, Rundensteiner EA (2006) Exploratory visualization of multivariate data with variable quality. In: Proceedings of the IEEE symposium on visual analytics science and technology, pp 183–190
Xie Z, Ward MO, Rundensteiner EA, Huang S (2007) Integrating data and quality space interactions in exploratory visualizations. In: Proceedings of the 5th international conference on coordinated & multiple views in exploratory visualization, pp 47–60
Yang J, Ward M, Rundensteiner E (2002) InterRing: an interactive tool for visually navigating and manipulating hierarchical structures. In: Proceedings of the IEEE symposium on information visualization, pp 77–84
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported under NSF grants IIS-0119276 and IIS-0414380.
Rights and permissions
About this article
Cite this article
Ward, M., Xie, Z., Yang, D. et al. Quality-aware visual data analysis. Comput Stat 26, 567–584 (2011). https://doi.org/10.1007/s00180-010-0226-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-010-0226-0