Abstract
With the rapid growth of data in size and complexity, that are available on shared cloud computing platform, the threat of malicious activities and computer crimes has increased accordingly. Thus, investigating efficient data visualization techniques for visual analytics of such big data and visual intrusion detection over data intensive cloud computing is urgently required. In this paper, we first propose a new parallel coordinates visualization method that uses arced-axis for high-dimensional data representation. This new geometrical scheme can be efficiently used to identify the main features of network attacks by displaying recognizable visual patterns. In addition, with the aim of visualizing the clear and detailed structure of the dataset according to the contribution of each attribute, we propose a meaningful layout for the new method based on singular value decomposition algorithm, which possesses statistical property and can overcome the curse of dimensionality. Finally, we design a prototype system for network scan detection, which is based on our visualization approach. The experiments have shown that our approach is effective in visualizing multivariate datasets and detecting attacks from a variety of networking patterns, such as the features of DDoS attacks.
Similar content being viewed by others
References
Zhang X, Liu C, Nepal S, Pandey S, Chen J (2013) A privacy leakage upper-bound constraint based approach for cost-effective privacy preserving of intermediate datasets in cloud. IEEE Trans Parallel Distrib Syst 24(6):1192–1202
Zhang X, Yang LT, Liu C, Chen J (2014) A scalable two-phase top-down specialization approach for data anonymization using mapreduce on cloud. IEEE Trans Parallel Distrib Syst 25(2):363–373
Zhang X, Liu C, Nepal S, Chen J (2013) An efficient quasi-identifier index based approach for privacy preservation over incremental data sets on cloud. J Comput Syst Sci 79(5):542–555
Claessen JH, van Wijk JJ (2011) Flexible linked axes for multivariate data visualization. IEEE Trans Vis Comput Graph 17(12):2310–2316
Inselberg A (1985) The plane with parallel coordinates. Vis Comput 1(2):69–91
Wegman E (1990) Hyperdimensional data analysis using parallel coordinates. J Am Stat Assoc 85(411):664–675
Dasgupta A, Kosara R (2010) Pargnostics: screen-space metrics for parallel coordinates. IEEE Trans Vis Comput Graph 16(6):1017–1026
Huh M-H, Park DY (2008) Enhancing parallel coordinate plots. J Korean Stat Soc 37(2):129–133
Zhou H, Yuan X, Qu H, Cui W, Chen B (2008) Visual clustering in parallel coordinates. Comput Graph Forum 27(3):1047–1054
Zhou H, Cui W, Qu H, Wu Y, Yuan X, Zhuo W (2009) Splatting the lines in parallel coordinates. Comput Graph Forum 28(3):759–766
Dang TN, Wilkinson L (2010) A stacking graphic elements to avoid over-plotting. IEEE Trans Vis Comput Graph 16(6):1044–1052
Artero AO, de Oliveira MCF, Levkowitz H (2004) Uncovering clusters in crowded parallel coordinates visualizations. In: Proceedings of IEEE symposium on information visualization, INFOVIS 2004, pp 81–88
Yuan X, Guo P, Xiao H, Zhou H, Qu H (2009) Scattering points in parallel coordinates. IEEE Trans Vis Comput Graph 15(6):1001–1008
Peng W, Ward MO, Rundensteiner EA (2004) Clutter reduction in multi-dimensional data visualization using dimension reordering. In: Proceedings of IEEE symposium on information visualization, INFOVIS 2004, pp 89–96
Artero AO, de Oliveira MCF, Levkowitz H (2006) Enhanced high dimensional data visualization through dimension reduction and attribute arrangement. In: Proceedings of The tenth international conference on information visualization, IV 2006, pp 707–712
Ferdosi BJ, Roerdink JB (2011) Visualizing high-dimensional structures by dimension ordering and filtering using subspace analysis. Comput Graph Forum 30(3):1121–1130
Claessen JH, Van Wijk JJ (2011) Flexible linked axes for multivariate data visualization. IEEE Trans Vis Comput Graph 17(12):2310–2316
Tominski C, Abello J, Schumann H (2004 ) Axes-based visualizations with radial layouts. In: Proceedings of the ACM symposium on applied, computing, pp 1242–1247
Hauser H, Ledermann F, Doleisch H (2002) Angular brushing of extended parallel coordinates. In: Proceedings of IEEE symposium on information visualization, INFOVIS 2002, pp 127–130
Dimsdale B (1984) Conic transformations and projectivities. Technical Report, 6320-2753
Golub G, Van Loan C (1996) Matrix computations, vol 3. Johns Hopkins University Press, Baltimore
Simek K (2003) Properties of singular value decomposition based dynamical model of gene expression data. Int J Appl Math Comput Sci 13(3):337–346
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Huang, M.L., Lu, L.F. & Zhang, X. Using arced axes in parallel coordinates geometry for high dimensional BigData visual analytics in cloud computing. Computing 97, 425–437 (2015). https://doi.org/10.1007/s00607-014-0383-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-014-0383-z
Keywords
- Multivariate data visualization
- High-dimensional data visualization
- Parallel coordinate geometry
- Arced-axis
- Network security
- Network intrusion detection