Skip to main content
Log in

Quality-aware visual data analysis

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

The quality, certainty, or confidence of decisions made during the visual analytics process depends on many factors, including the completeness and reliability of the initial data, information loss due to filtering, sampling, and other transformations, and the accuracy and clarity of the visual presentation. Unfortunately, in most visualization tools the analyst is unaware of these and other forces that degrade the meaningfulness of their results. In this paper, we describe our efforts to design strategies for tackling the measurement, display, and utilization of quality aspects at all stages of the visualization pipeline. The goal is to help analysts maintain an awareness of the accuracy and completeness of the information conveyed in the images, and subsequently the patterns observed and decisions made based on the analysis. Quality measures can be used both to assist analysts in selecting, transforming, and mapping their data as well as to automatically refine processes to generate higher quality views. We have implemented several such techniques within XmdvTool, a public-domain package for visual analytics. We illustrate the quality-specific components of our tool with several case studies to show the usefulness of the approach. We also describe user studies that were performed to validate the accuracy of our quality measures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ahlberg C, Shneiderman B (1994) Visual information seeking using the filmfinder. In: Proceedings of the ACM SIGCHI conference on human factors in computing systems, 2:433

  • Amar R, Stasko J (2004) A knowledge task-based framework for design and evaluation of information visualizations. In: Proceedings of the IEEE symposium on information visualization, pp 143–150

  • Asuncion A, Newman D (2007) UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html

  • Bertini E, Santucci G (2004) Quality metrics for 2d scatterplot graphics: automatically reducing visual clutter. In: Proceedings of 4th international symposium on smartGraphics, pp 77–89

  • Boutin F, Hascoet M (2004) Cluster validity indices for graph partitioning. In: Eighth international conference on information visualisation (IV’04), pp 376–381

  • Carreira-Perpinan M (1997) A review of dimension reduction techniques. Tech. Rep. CS–96–09, Dept. of Computer Science, University of Sheffield, http://faculty.ucmerced.edu/mcarreira-perpinan/papers/cs-96-09.pdf

  • Cui Q, Ward M, Rundensteiner E, Yang J (2006) Measuring data abstraction quality in multiresolution visualizations. IEEE Trans Vis Comput Graph 12(5): 709–716

    Article  Google Scholar 

  • Dix A, Ellis G (2002) By chance—enhancing interaction with large data sets through statistical sampling. In: Proceedings of advanced visual interfaces, pp 167–176

  • Djurcilov S, Kim K, Lermusiaux P, Pang A (2002) Visualizing scalar volumetric data with uncertainty. Comput Graph 26(2): 239–248

    Article  Google Scholar 

  • Duda R, Hart P, Stork D (2001) Pattern classification. 2nd edn. Wiley, London

    MATH  Google Scholar 

  • Friendly M, Kwan E (2003) Effect ordering for data displays. Comput Stat Data Anal 43: 509–539

    Article  MathSciNet  MATH  Google Scholar 

  • Fua Y, Ward M, Rundensteiner E (2000) Structure-based brushes: a mechanism for navigating hierarchically organized data and information spaces. IEEE Trans Vis Comput Graph 6(2): 150–159

    Article  Google Scholar 

  • Haase H (1998) Mirror, mirror on the wall, who has the best visualization of all? a reference model for visualization quality. In: Proceedings of visualization in scientific computing ’98, pp 117–128

  • Hofmann H, Theus M (1998) Selection sequences in MANET. Comput Stat 13(1): 77–88

    Google Scholar 

  • Hunter G (1999) New tools for handling spatial data quality: moving from academic concepts to practical reality. URISA J 11(2): 25–34

    Google Scholar 

  • Luo A, Kao D, Pang A (2003) Visualizing spatial distribution data sets. In: VISSYM ’03: proceedings of the symposium on data visualisation 2003, Eurographics Association, Aire-la-Ville, Switzerland, pp 29–38

  • MacEachren AM (1992) Visualizing uncertain information. Cartogr Perspect 13: 10–19

    Google Scholar 

  • Martin A, Ward M (1995) High dimensional brushing for interactive exploration of multivariate data. In: Proceedings of the IEEE visualization, pp 271–278

  • Olston C, Mackinlay J (2002) Visualizing data with bounded uncertainty. In: Proceedings of the IEEE symposium on information visualization, pp 37–40

  • Pang A (2001) Visualizing uncertainty in geo-spatial data. In: Proceedings of worshop on the intersections between geospatial information and information technology

  • Pang A, Wittenbrink C, Lodha S (1997) Approaches to uncertainty visualization. Vis Comput 13(8): 370–390

    Article  Google Scholar 

  • Peng W, Ward M, Rundensteiner E (2004) Clutter reduction in multi-dimensional data visualization using dimension reordering. In: Proceedings of the IEEE symposium on information visualization, pp 89–96

  • Rao R, Card S (1994) The table lens: merging graphical and symbolic representations in an interactive focus+context visualization for tabular information. In: Proceedings of ACM SIGCHI conference on human factors in computing systems, pp 318–322

  • Rosenholtz R, Li Y, Mansfield J, Jin Z (2005) Feature congestion: a measure of display clutter. In: CHI ’05: Proceedings of the SIGCHI conference on human factors in computing systems, pp 761–770

  • Sanyal J, Zhang S, Bhattacharya G, Amburn P, Moorhead R (2009) A user study to compare four uncertainty visualization methods for 1D and 2D datasets. IEEE Trans Vis Comput Graph 15(6): 1209–1218

    Article  Google Scholar 

  • Swayne D, Buja A (1998) Missing data in interactive high-dimensional data visualization. Comput Stat 13(1): 15–26

    MATH  Google Scholar 

  • Tekušová TT, Knuth M, Schreck T, Kohlhammer J (2008) Data quality visualization for multivariate hierarchic data. In: InfoVis demo. http://www.gris.informatik.tu-darmstadt.de/~tschreck/papers/infovis08-poster.pdf

  • Unwin A, Hawkins G, Hofmann H, Siegl B (1996) Interactive graphics for data sets with missing values—MANET. J Comput Graph Stat 5(2): 113–122

    Article  Google Scholar 

  • Wang C, Ma K (2008) A statistical approach to volume data quality assessment. IEEE Trans Vis Comput Graph 14(3): 590–602

    Article  Google Scholar 

  • Ward M (1994) Xmdvtool: Integrating multiple methods for visualizing multivariate data. In: Proceedings of the IEEE visualization, pp 326–333

  • Wilkinson L, Anand A, Grossman RL (2005) Graph-theoretic scagnostics. In: Proceedings of the IEEE symposium on information visualization, pp 157–164

  • Wittenbrink C, Pang A, Lodha S (1996) Glyphs for visualizing uncertainty in vector fields. IEEE Trans Vis Comput Graph 2(3): 266–279

    Article  Google Scholar 

  • Xie Z, Huang S, Ward MO, Rundensteiner EA (2006) Exploratory visualization of multivariate data with variable quality. In: Proceedings of the IEEE symposium on visual analytics science and technology, pp 183–190

  • Xie Z, Ward MO, Rundensteiner EA, Huang S (2007) Integrating data and quality space interactions in exploratory visualizations. In: Proceedings of the 5th international conference on coordinated & multiple views in exploratory visualization, pp 47–60

  • Yang J, Ward M, Rundensteiner E (2002) InterRing: an interactive tool for visually navigating and manipulating hierarchical structures. In: Proceedings of the IEEE symposium on information visualization, pp 77–84

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Matthew Ward.

Additional information

This work is supported under NSF grants IIS-0119276 and IIS-0414380.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ward, M., Xie, Z., Yang, D. et al. Quality-aware visual data analysis. Comput Stat 26, 567–584 (2011). https://doi.org/10.1007/s00180-010-0226-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-010-0226-0

Keywords

Navigation