Abstract
Streaming data can be found in many different scenarios, in which data are generated and arriving continuously. Sampling approaches have been proven as an effective means to cope with the sheer volume of the streaming data. However, sampling methods also introduce uncertainty, which can affect the reliability of subsequent analysis and visualization. In this paper, we propose a novel model called PDm and visualization named uncertainty tree to present uncertainty that arises from sampling streaming data. PDm is first introduced to characterize uncertainty of streaming data, and an optimization method is then proposed to minimize uncertainty. Uncertainty tree is further developed to enhance data understanding by visualizing uncertainty and revealing temporal patterns of streaming data. Lastly, a quantitative evaluation and real-world examples have been conducted to demonstrate the effectiveness and efficacy of the proposed techniques.
Graphical abstract
Similar content being viewed by others
References
Cao N, Lin YR, Gotz D (2016) Untangle map: visual analysis of probabilistic multi-label data. IEEE Trans Vis Comput Graph 22(2):1149–1163
Chen H, Zhang S, Chen W, Mei H, Zhang J, Mercer A, Liang R, Qu H (2015) Uncertainty-aware multidimensional ensemble data visualization and exploration. IEEE Trans Vis Comput Graph 21(9):1072–1086
Correll M, Heer J (2016) Surprise! bayesian weighting for de-biasing thematic maps. IEEE Trans Vis Comput Graph 23(1):651–660
Crouser RJ, Franklin L, Endert A, Cook K (2017) Toward theoretical techniques for measuring the use of human effort in visual analytic systems. IEEE Trans Vis Comput Graph 23(1):121–130
Cui W, Liu S, Wu Z, Wei H (2014) How hierarchical topics evolve in large text corpora. IEEE Trans Vis Comput Graph 20(12):2281–2290
Efraimidis PS, Spirakis PG (2006) Weighted random sampling with a reservoir. Inf Process Lett 97(5):181–185
Feng D, Kwock L, Lee Y, Taylor R (2010) Matching visual saliency to confidence in plots of uncertain data. IEEE Trans Vis Comput Graph 16(6):980–989
Freedman D, Robert P, Purves R (2007) Chance Errors in Sampling. In: Statistics, 4th edn. pp 355–374
Gosink L, Bensema K, Pulsipher T, Obermaier H, Henry M, Childs H, Joy KI, Owhadi H, Scovel C, Sullivan T (2013) Characterizing and visualizing predictive uncertainty in numerical ensembles through bayesian model averaging. IEEE Trans Vis Comput Graph 19(12):2703–2712
Huron S, Vuillemot R, Fekete JD (2013) Visual sedimentation. IEEE Trans Vis Comput Graph 19(12):2446–2455
ISO (2008) Evaluation of measurement data—guide to the expression of uncertainty in measurement
Kim A, Blais E, Parameswaran A, Indyk P, Madden S, Rubinfeld R (2015) Rapid sampling for visualizations with ordering guarantees. Proc VLDB Endow 8(5):521–532
Liu M, Liu S, Zhu X, Liao Q, Wei F, Pan S (2016a) An uncertainty-aware approach for exploratory microblog retrieval. IEEE Trans Vis Comput Graph 22(1):250–259
Liu S, Yin J, Wang X, Cui W, Cao K, Pei J (2016b) Online visual analytics of text streams. IEEE Trans Vis Comput Graph 22(11):2451–2466
MacEachren AM (1992) Visualizing uncertain information. Cartogr Perspect 13:10–19
Mirzargar M, Whitaker RT, Kirby RM (2014) Curve boxplot: generalization of boxplot for ensembles of curves. IEEE Trans Vis Comput Graph 20(12):2654–2663
Pak CW, Foote H, Adams D, Cowley W, Thomas J (2003) Dynamic visualization of transient data streams. In: Proceedings of the IEEE symposium on information visualization, pp 97–104
Pang AT, Wittenbrink CM, Lodha SK (1997) Approaches to uncertainty visualization. Vis Comput 13(8):370–390
Park Y, Cafarella M, Mozafari B (2016) Visualization-aware sampling for very large databases. In: 2016 IEEE 32nd international conference on data engineering (ICDE). IEEE, pp 755–766
Potter K, Kniss J, Riesenfeld R, Johnson C (2010) Visualizing summary statistics and uncertainty. Comput Graph Forum 29(3):823–832
Potter K, Rosen P, Johnson CR (2012) From quantification to visualization: a taxonomy of uncertainty visualization approaches. IFIP Adv Inf Commun Technol 377:226–249
Schulz C, Nocaj A, Goertler J, Deussen O, Brandes U, Weiskopf D (2017) Probabilistic graph layout for uncertain network visualization. IEEE Trans Vis Comput Graph 23(1):531–540
Skeels M, Lee B, Smith G, Robertson G (2008) Revealing uncertainty for information visualization. In: Proceedings of the working conference on advanced visual interfaces, pp 376–379
Tanahashi Y, Hsueh CH, Ma KL (2015) An efficient framework for generating storyline visualizations from streaming data. IEEE Trans Vis Comput Graph 21(6):730–742
Thomson J, Hetzler E, MacEachren A, Gahegan M, Pavel M (2005) A typology for visualizing uncertainty. Proc SPIE Vis Data Anal 5669:146
Vitter JS (1985) Random sampling with a reservoir. ACM Trans Math Softw 11(1):37–57
Wan M, Chen X, Kaplan L, Han J, Gao J, Zhao B (2016) From truth discovery to trustworthy opinion discovery: an uncertainty-aware quantitative modeling approach. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 1885–1894
Whitaker RT, Mirzargar M, Kirby RM (2013) Contour boxplots: a method for characterizing uncertainty in feature sets from simulation ensembles. IEEE Trans Vis Comput Graph 19(12):2713–2722
Wu Y, Wei F, Liu S, Au N, Cui W, Zhou H, Qu H (2010) OpinionSeer: interactive visualization of hotel customer feedback. IEEE Trans Vis Comput Graph 16(6):1109–1118
Wu Y, Yuan GX, Ma KL (2012) Visualizing flow of uncertainty through analytical processes. IEEE Trans Vis Comput Graph 18(12):2526–2535
Wu Y, Cao N, Gotz D, Tan YP, Keim DA (2016) A survey on visual analytics of social media data. IEEE Trans Multimed 18(11):2135–2148
Xu P, Mei H, Ren L, Chen W (2017) ViDX: visual diagnostics of assembly line performance in smart factories. IEEE Trans Vis Comput Graph 23(1):291–300
Zuk T, Carpendale S (2006) Theoretical analysis of uncertainty visualizations. In: Proceedings of the SPIE visualization and data analysis, vol 6060, p 606007-14
Zuk T, Carpendale S (2007) Visualization of uncertainty and reasoning. Proc Int Symp Smart Graph 4569:164–177
Acknowledgements
The work was supported by NSFC (61761136020, 61502416), NSFC-Zhejiang Joint Fund for the Integration of Industrialization and Informatization (U1609217), Zhejiang Provincial Natural Science Foundation (LR18F020001) and the 100 Talents Program of Zhejiang University. This project was also partially funded by Microsoft Research Asia.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tang, T., Yuan, K., Tang, J. et al. Toward the better modeling and visualization of uncertainty for streaming data. J Vis 22, 79–93 (2019). https://doi.org/10.1007/s12650-018-0518-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12650-018-0518-y