Skip to main content
Log in

On visualizing large multidimensional datasets with a multi-threaded radial approach

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

In this paper, we study how to visualize large amounts of multidimensional data with a radial visualization. For such a visualization, we study a multi-threaded implementation on the CPU and the GPU. We start by reviewing the approaches that have visualized the largest multidimensional datasets and we focus on the approaches that have used CPU or GPU parallelization. We consider the radial visualizations and we describe our approach (called POIViz) that uses points of interest to determine a layout of a large dataset. We detail its parallelization on the CPU and the GPU. We study the efficiency of this approach with different configurations and for large datasets. We show that it can visualize, in less than one second, millions of data with tens of dimensions, and that it can support “real-time” interactions even for large datasets. We conclude on the advantages and limits of the proposed visualization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. http://lib.stat.cmu.edu/multi/pca.c.

References

  1. Andrecut, M.: Parallel gpu implementation of iterative pca algorithms. J. Comput. Biol. 16(11), 1593–1599 (2009)

    Article  MathSciNet  Google Scholar 

  2. Ankerst, M., Keim, D.A., Kriegel, H.P.: Circle segments: a technique for visually exploring large multidimensional data sets. In: Proceedings of the IEEE Visualization’96, Hot Topics 96 (1996)

  3. Bache, K., Lichman, M.: UCI machine learning repository (2013). URL http://archive.ics.uci.edu/ml

  4. Carr, D.B., Littlefield, R.J., Nicholson, W.L., Littlefield, J.S.: Scatterplot matrix techniques for large n. J. Am. Stat. Assoc. 82(398), 424–436 (1987)

    MathSciNet  Google Scholar 

  5. Da Costa, D., Venturini, G.: An interactive visualization environment for data exploration using points of interest. In: Li, X., Zaïane, O.R., Li, Z. (eds.) ADMA, Lecture Notes in Computer Science, pp. 416–423. Springer, Xi’an (2006)

    Google Scholar 

  6. Daniels, K., Grinstein, G., Russell, A., Glidden, M.: Properties of normalized radial visualizations. Inf. Vis. 11(4), 273–300 (2012)

    Article  Google Scholar 

  7. Fester, T., Schreiber, F., Strickert, M.: Cuda-based multi-core implementation of mds-based bioinformatics algorithms. In: Grosse, I., Neumann, S., Posch, S., Schreiber, F., Stadler, P. (eds.) German Conference on Bioinformatics 2009, 28th to 30th September 2009. Lecture Notes in Informatics, vol. 157, pp. 67–79. GI-Edition, Martin Luther University Halle-Wittenberg, Germany (2009)

  8. Florek, M.: Using modern hardware for interactive information visualization of large data. Master’s thesis, Comenius University, Bratislava (2006)

  9. Frishman, Y., Tal, A.: Multi-level graph layout on the gpu. IEEE Trans. Vis. Comput. Graph. 13(6), 1310–1319 (2007)

    Article  Google Scholar 

  10. Fua, Y.H., Ward, M.O., Rundensteiner, E.A.: Hierarchical parallel coordinates for exploration of large datasets. In: Proceedings of the Conference on Visualization ’99: Celebrating Ten Years, VIS ’99, pp. 43–50. IEEE Computer Society Press (1999)

  11. Gregg, C., Hazelwood, K.: Where is the data? why you cannot debate cpu vs. gpu performance without the answer. In: IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2011, pp. 134–144 (2011)

  12. Hoffman, P., Grinstein, G.G., Pinkney, D.: Dimensional anchors: a graphic primitive for multidimensional multivariate information visualizations. In: Workshop on New Paradigms in Information Visualization and Manipulation’99, pp. 9–16 (1999)

  13. Ingram, S., Munzner, T., Olano, M.: Glimmer: multilevel MDS on the GPU. IEEE Trans. Vis. Comput. Graph. 15, 249–261 (2009)

    Article  Google Scholar 

  14. Jian, L., Wang, C., Liu, Y., Liang, S., Yi, W., Shi, Y.: Parallel data mining techniques on graphics processing unit with compute unified device architecture (CUDA). J. Supercomput. 64, 942–967 (2013)

  15. Keim, D.A.: Information visualization and visual data mining. IEEE Trans. Vis. Comput. Graph. 8(1), 1–8 (2002)

    Article  MathSciNet  Google Scholar 

  16. Keim, D.A., Ankerst, M., Kriegel, H.P.: Recursive pattern: a technique for visualizing very large amounts of data. In: Proceedings of the 6th conference on Visualization’95 pp. 279–286 (1995)

  17. Liu, T., Bouali, F., Venturini, G.: Visualisation radiale: approche parallèle entre cpu et gpu. Revue des Nouvelles Technologies de l’Information, Extraction et Gestion des Connaissances, RNTI-E-24, 175–180 (2013)

  18. Liu, Z., Jiang, B., Heer, J.: immens: Real-time visual querying of big data. Computer Graphics Forum (Proceedings of EuroVis) (2013)

  19. McConnell, S., Sturgeon, R., Henry, G., Mayne, A., Hurley, R.: Scalability of self-organizing maps on a gpu cluster using opencl and cuda. J. Phys. 341(1), 012018 (2012)

    Google Scholar 

  20. McCormick, W.T., Schweitzer, P.J., White, T.W.: Problem decomposition and data reorganization by a clustering technique. Oper. Res. 20(5), 993–1009 (1972)

    Article  MATH  Google Scholar 

  21. McDonnel, B., Elmqvist, N.: Towards utilizing gpus in information visualization: A model and implementation of image-space operations. IEEE Trans. Vis. Comput. Graph. 15(6), 1105–1112 (2009)

    Article  Google Scholar 

  22. Oesterling, P., Heine, C., Janicke, H., Scheuermann, G., Heyer, G.: Visualization of high-dimensional point clouds using their density distribution’s topology. IEEE Trans. Vis. Comput. Graph. 17(11), 1547–1559 (2011)

    Article  Google Scholar 

  23. Owens, J.D., Houston, M., Luebke, D., Green, S., Stone, J.E., Phillips, J.C.: Gpu computing. Proc. IEEE 96(5), 879–899 (2008)

    Article  Google Scholar 

  24. Reina, G., Ertl, T.: Implementing fastmap on the gpu: considerations on general-purpose computation on graphics hardware. Theory Pract. Comput. Graph. 5, 51–58 (2005)

    Google Scholar 

  25. Russell, A., Daniels, K., Grinstein, G.: Voronoi diagram based dimensional anchor assessment for radial visualizations. In: 16th International Conference on Information Visualisation (IV), 2012, pp. 229–233 (2012)

  26. Ryoo, S., Rodrigues, C.I., Baghsorkhi, S.S., Stone, S.S., Kirk, D.B., Hwu, W.M.W.: Optimization principles and application performance evaluation of a multithreaded gpu using cuda. In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming, pp. 73–82. ACM (2008)

  27. Shneiderman, B.: The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings of the 1996 IEEE Symposium on Visual Languages, VL ’96, pp. 336–343. IEEE Computer Society (1996)

  28. Varoneckas, A., Žilinskas, A., Žilinskas, J.: Parallel multidimensional scaling using grid computing: assessment of performance. Inf. Technol. Control 37(1), 52–56 (2008)

    MATH  Google Scholar 

  29. Venturini, G., Costa, D.D.: Une méthode de classification visuelle et interactive. Revue des Nouvelles Technologies de l’Information, Classification: points de vue croiss, RNTI-C-2, 17–29 (2008)

  30. Wegman, E., Luo, Q.: Visualizing densities. Technical report // George Mason University, Center for Computational Statistics (1995)

  31. Wegman, E.J.: Huge data sets and the frontiers of computational feasibility. J. Comput. Graph. Stat. 4, 281–295 (1995)

    MathSciNet  Google Scholar 

  32. Weiskopf, D.: GPU-Based Interactive Visualization Techniques (Mathematics and Visualization). Springer, New York (2006)

    MATH  Google Scholar 

  33. Wong, P.C., Bergeron, R.D.: Scientific Visualization—Overviews. Methodologies and Techniques. 30 years of multidimensional multivariate visualization. IEEE Computer Society Press, Los Alamitos (1997)

    Google Scholar 

  34. Yi, J.S., Melton, R., Stasko, J., Jacko, J.A.: Dust & magnet: multivariate information visualization using a magnet metaphor. Inf. Vis. 4(4), 239–256 (2005)

    Google Scholar 

  35. Zhongwen, L., Hongzhi, L., Zhengping, Y., Xincai, W.: Self-organizing maps computing on graphic process unit. In: 13th European Symposium on Artificial Neural Networks (ESANN), pp. 557–562. d-side (2005)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gilles Venturini.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, T., Bouali, F. & Venturini, G. On visualizing large multidimensional datasets with a multi-threaded radial approach. Distrib Parallel Databases 34, 321–345 (2016). https://doi.org/10.1007/s10619-015-7174-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10619-015-7174-1

Keywords

Navigation