Abstract:
Varied edge bundling methods have been used to reduce visual clutter in parallel coordinates plots (PCP). However, existing edge-bundled PCP do not scale well for visual ...Show MoreMetadata
Abstract:
Varied edge bundling methods have been used to reduce visual clutter in parallel coordinates plots (PCP). However, existing edge-bundled PCP do not scale well for visual analysis of multidimensional big data and often overplot the bundles in the area near the axes. In this study, we propose a scalable lightweight bundling method to support visual analysis of multidimensional big data in PCP. It helps the users discover trends and detect outliers in the data by bundling the edges between each two adjacent axes independently. We integrate human judgments into the two-dimensional data binning by novel interactions to accelerate the clustering process of the data. We use the frequency-based representation to render the clusters as histogram-like bundles to reveal the distribution of the data and eliminate the overplotting of the bundles. Based on our method, we build a lightweight web-based visual analytics system for exploring multidimensional big data in PCP. The scalability analysis of our method shows that its clustering time increases linearly with the size of the data. Its rendering time is independent of the size of the data. It can cluster and visualize 1 million data records with 6 dimensions in about 1 second in web-based visualization without pre-computation of the data or hardware-accelerated rendering. We conduct two case studies and a user study to compare our method with classic PCP and two state-of-the-art edge-bundled PCP. The results show that our method is more efficient and effective for visually analyzing multidimensional big data.
Published in: IEEE Transactions on Big Data ( Volume: 9, Issue: 1, 01 February 2023)