Abstract
In this paper, we study how to visualize large amounts of multidimensional data with a radial visualization. For such a visualization, we study a multi-threaded implementation on the CPU and the GPU. We start by reviewing the approaches that have visualized the largest multidimensional datasets and we focus on the approaches that have used CPU or GPU parallelization. We consider the radial visualizations and we describe our approach (called POIViz) that uses points of interest to determine a layout of a large dataset. We detail its parallelization on the CPU and the GPU. We study the efficiency of this approach with different configurations and for large datasets. We show that it can visualize, in less than one second, millions of data with tens of dimensions, and that it can support “real-time” interactions even for large datasets. We conclude on the advantages and limits of the proposed visualization.
Similar content being viewed by others
Notes
http://lib.stat.cmu.edu/multi/pca.c.
References
Andrecut, M.: Parallel gpu implementation of iterative pca algorithms. J. Comput. Biol. 16(11), 1593–1599 (2009)
Ankerst, M., Keim, D.A., Kriegel, H.P.: Circle segments: a technique for visually exploring large multidimensional data sets. In: Proceedings of the IEEE Visualization’96, Hot Topics 96 (1996)
Bache, K., Lichman, M.: UCI machine learning repository (2013). URL http://archive.ics.uci.edu/ml
Carr, D.B., Littlefield, R.J., Nicholson, W.L., Littlefield, J.S.: Scatterplot matrix techniques for large n. J. Am. Stat. Assoc. 82(398), 424–436 (1987)
Da Costa, D., Venturini, G.: An interactive visualization environment for data exploration using points of interest. In: Li, X., Zaïane, O.R., Li, Z. (eds.) ADMA, Lecture Notes in Computer Science, pp. 416–423. Springer, Xi’an (2006)
Daniels, K., Grinstein, G., Russell, A., Glidden, M.: Properties of normalized radial visualizations. Inf. Vis. 11(4), 273–300 (2012)
Fester, T., Schreiber, F., Strickert, M.: Cuda-based multi-core implementation of mds-based bioinformatics algorithms. In: Grosse, I., Neumann, S., Posch, S., Schreiber, F., Stadler, P. (eds.) German Conference on Bioinformatics 2009, 28th to 30th September 2009. Lecture Notes in Informatics, vol. 157, pp. 67–79. GI-Edition, Martin Luther University Halle-Wittenberg, Germany (2009)
Florek, M.: Using modern hardware for interactive information visualization of large data. Master’s thesis, Comenius University, Bratislava (2006)
Frishman, Y., Tal, A.: Multi-level graph layout on the gpu. IEEE Trans. Vis. Comput. Graph. 13(6), 1310–1319 (2007)
Fua, Y.H., Ward, M.O., Rundensteiner, E.A.: Hierarchical parallel coordinates for exploration of large datasets. In: Proceedings of the Conference on Visualization ’99: Celebrating Ten Years, VIS ’99, pp. 43–50. IEEE Computer Society Press (1999)
Gregg, C., Hazelwood, K.: Where is the data? why you cannot debate cpu vs. gpu performance without the answer. In: IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2011, pp. 134–144 (2011)
Hoffman, P., Grinstein, G.G., Pinkney, D.: Dimensional anchors: a graphic primitive for multidimensional multivariate information visualizations. In: Workshop on New Paradigms in Information Visualization and Manipulation’99, pp. 9–16 (1999)
Ingram, S., Munzner, T., Olano, M.: Glimmer: multilevel MDS on the GPU. IEEE Trans. Vis. Comput. Graph. 15, 249–261 (2009)
Jian, L., Wang, C., Liu, Y., Liang, S., Yi, W., Shi, Y.: Parallel data mining techniques on graphics processing unit with compute unified device architecture (CUDA). J. Supercomput. 64, 942–967 (2013)
Keim, D.A.: Information visualization and visual data mining. IEEE Trans. Vis. Comput. Graph. 8(1), 1–8 (2002)
Keim, D.A., Ankerst, M., Kriegel, H.P.: Recursive pattern: a technique for visualizing very large amounts of data. In: Proceedings of the 6th conference on Visualization’95 pp. 279–286 (1995)
Liu, T., Bouali, F., Venturini, G.: Visualisation radiale: approche parallèle entre cpu et gpu. Revue des Nouvelles Technologies de l’Information, Extraction et Gestion des Connaissances, RNTI-E-24, 175–180 (2013)
Liu, Z., Jiang, B., Heer, J.: immens: Real-time visual querying of big data. Computer Graphics Forum (Proceedings of EuroVis) (2013)
McConnell, S., Sturgeon, R., Henry, G., Mayne, A., Hurley, R.: Scalability of self-organizing maps on a gpu cluster using opencl and cuda. J. Phys. 341(1), 012018 (2012)
McCormick, W.T., Schweitzer, P.J., White, T.W.: Problem decomposition and data reorganization by a clustering technique. Oper. Res. 20(5), 993–1009 (1972)
McDonnel, B., Elmqvist, N.: Towards utilizing gpus in information visualization: A model and implementation of image-space operations. IEEE Trans. Vis. Comput. Graph. 15(6), 1105–1112 (2009)
Oesterling, P., Heine, C., Janicke, H., Scheuermann, G., Heyer, G.: Visualization of high-dimensional point clouds using their density distribution’s topology. IEEE Trans. Vis. Comput. Graph. 17(11), 1547–1559 (2011)
Owens, J.D., Houston, M., Luebke, D., Green, S., Stone, J.E., Phillips, J.C.: Gpu computing. Proc. IEEE 96(5), 879–899 (2008)
Reina, G., Ertl, T.: Implementing fastmap on the gpu: considerations on general-purpose computation on graphics hardware. Theory Pract. Comput. Graph. 5, 51–58 (2005)
Russell, A., Daniels, K., Grinstein, G.: Voronoi diagram based dimensional anchor assessment for radial visualizations. In: 16th International Conference on Information Visualisation (IV), 2012, pp. 229–233 (2012)
Ryoo, S., Rodrigues, C.I., Baghsorkhi, S.S., Stone, S.S., Kirk, D.B., Hwu, W.M.W.: Optimization principles and application performance evaluation of a multithreaded gpu using cuda. In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pp. 73–82. ACM (2008)
Shneiderman, B.: The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings of the 1996 IEEE Symposium on Visual Languages, VL ’96, pp. 336–343. IEEE Computer Society (1996)
Varoneckas, A., Žilinskas, A., Žilinskas, J.: Parallel multidimensional scaling using grid computing: assessment of performance. Inf. Technol. Control 37(1), 52–56 (2008)
Venturini, G., Costa, D.D.: Une méthode de classification visuelle et interactive. Revue des Nouvelles Technologies de l’Information, Classification: points de vue croiss, RNTI-C-2, 17–29 (2008)
Wegman, E., Luo, Q.: Visualizing densities. Technical report // George Mason University, Center for Computational Statistics (1995)
Wegman, E.J.: Huge data sets and the frontiers of computational feasibility. J. Comput. Graph. Stat. 4, 281–295 (1995)
Weiskopf, D.: GPU-Based Interactive Visualization Techniques (Mathematics and Visualization). Springer, New York (2006)
Wong, P.C., Bergeron, R.D.: Scientific Visualization—Overviews. Methodologies and Techniques. 30 years of multidimensional multivariate visualization. IEEE Computer Society Press, Los Alamitos (1997)
Yi, J.S., Melton, R., Stasko, J., Jacko, J.A.: Dust & magnet: multivariate information visualization using a magnet metaphor. Inf. Vis. 4(4), 239–256 (2005)
Zhongwen, L., Hongzhi, L., Zhengping, Y., Xincai, W.: Self-organizing maps computing on graphic process unit. In: 13th European Symposium on Artificial Neural Networks (ESANN), pp. 557–562. d-side (2005)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, T., Bouali, F. & Venturini, G. On visualizing large multidimensional datasets with a multi-threaded radial approach. Distrib Parallel Databases 34, 321–345 (2016). https://doi.org/10.1007/s10619-015-7174-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10619-015-7174-1