Abstract
We propose an interactive analytical system for exploring and interpreting non-linear projections. Although non-linear projections are widely used in disclosing complex structures in high-dimensional analysis, there is a strong need to interpret them due to their inherent complexity and weak interpretability. In the machine learning and visualization communities, it is inspiring to use local linear models to fit and interpret non-linear models. Regarding non-linear projections, there are research gaps in both generation and exploration of linear fitting segments. To fill this gap, we propose an optimization algorithm to partition a non-linear projection into linear segments according to the feature of local affine transformations. We then construct a hierarchy of linear segments and conjunct hierarchical visualizations to support a coarse-to-fine exploration. After that, we design and implement a visual interface that integrates the proposed algorithms and a suite of visual tools. Three case studies demonstrate that the proposed approach facilitates the interpretation of non-linear projections.
Graphical abstract
Similar content being viewed by others
References
Balasubramanian M, Schwartz EL (2002) The isomap algorithm and topological stability. Science 295(5552):7–7
Bibal A, Vu VM, Nanfack G, Frénay B (2020) Explaining t-SNE embeddings locally by adapting lime. In: ESANN, pp 393–398
Bishop CM (2006) Pattern recognition and machine learning. springer
Boggust A, Carter B, Satyanarayan A (2022) Embedding comparator: Visualizing differences in global structure and local neighborhoods via small multiples. In: 27th international conference on intelligent user interfaces, pp 746–766
Cavallo M, Demiralp Ç (2018) A visual interaction framework for dimensionality reduction based data exploration. In: Proceedings of the 2018 CHI conference on human factors in computing systems, pp 1–13
Chatzimparmpas A, Martins RM, Kerren A (2020) t-visne: interactive assessment and interpretation of t-SNE projections. IEEE Trans Vis Comput Graph 26(8):2696–2714. https://doi.org/10.1109/TVCG.2020.2986996
Chen X, Zeng W, Lin Y, AI-Maneea HM, Roberts J, Chang R (2021) Composition and configuration patterns in multiple-view visualizations. IEEE Trans Vis Comput Graph 27(2):1514–1524. https://doi.org/10.1109/TVCG.2020.3030338
Cowan N (2001) The magical number 4 in short-term memory: a reconsideration of mental storage capacity. Behav Brain Sci 24(1):87–114. https://doi.org/10.1017/S0140525X01003922
Dua D, Graff C (2017) UCI machine learning repository
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7(2):179–188
Gower JC, Dijksterhuis GB et al (2004) Procrustes problems, vol. 30. Oxford University Press on Demand
Joia P, Coimbra D, Cuminato JA, Paulovich FV, Nonato LG (2011) Local affine multidimensional projection. IEEE Trans Vis Comput Graph 17(12):2563–2571
Kambhatla N, Leen TK (1997) Dimension reduction by local principal component analysis. Neural Comput 9(7):1493–1516
Kim T-K, Kittler J (2005) Locally linear discriminant analysis for multimodally distributed classes for face recognition with a single model image. IEEE Trans Patt Anal Mach Intell 27(3):318–327
Lensen A, Xue B, Zhang M (2020) Genetic programming for evolving a front of interpretable models for data visualization. IEEE Trans Cybern 51(11):5468–5482
Liu M, Shi J, Li Z, Li C, Zhu J, Liu S (2017) Towards better analysis of deep convolutional neural networks. IEEE Trans Vis Comput Graph 23(1):91–100
Liu S, Maljovec D, Wang B, Bremer P, Pascucci V (2017) Visualizing high-dimensional data: advances in the past decade. IEEE Trans Vis Comput Graph 23(3):1249–1268
Liu S, Wang X, Liu M, Zhu J (2017) Towards better analysis of machine learning models: a visual analytics perspective. Vis Inform 1(1):48–56. https://doi.org/10.1016/j.visinf.2017.01.006
Liu Y, Jun E, Li Q, Heer J (2019) Latent space cartography: visual analysis of vector space embeddings. In: Computer graphics forum, vol. 38, Wiley Online Library, pp 67–78
Ma Y, Maciejewski R (2020) Visual analysis of class separations with locally linear segments. IEEE Trans Vis Comput Graph. https://doi.org/10.1109/TVCG.2020.3011155
Maaten Lvd, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(Nov):2579–2605
Marion R, Bibal A, Frénay B (2019) Bir: a method for selecting the best interpretable multidimensional scaling rotation using external variables. Neurocomputing 342(C):83–96. https://doi.org/10.1016/j.neucom.2018.11.093
McInnes L, Healy J, Melville J (2018) Umap: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426,
Miller GA (1956) The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol Rev 63(2):81
Möller R, Hoffmann H (2004) An extension of neural gas to local PCA. Neurocomputing 62:305–326
Nonato LG, Aupetit M (2019) Multidimensional projection for visual analytics: linking techniques with distortions, tasks, and layout enrichment. IEEE Trans Vis Comput Graph 25(8):2650–2673. https://doi.org/10.1109/TVCG.2018.2846735
Ribeiro MT, Singh S, Guestrin C (2016) “why should i trust you?” explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
Soltanolkotabi M, Elhamifar E, Candes EJ (2014) Robust subspace clustering. Ann Stat 42(2):669–699
Stahnke J, Dörk M, Müller B, Thom A (2015) Probing projections: interaction techniques for interpreting arrangements and errors of dimensionality reductions. IEEE Trans Vis Comput Graph 22(1):629–638
Tenenbaum JB, Silva Vd, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. science 290(5500):2319–2323
Wang J, Saligrama V (2012) Local supervised learning through space partitioning. Adv Neural Inf Process Syst 25:91–99
Wang J, Saligrama V (2013) Locally-linear learning machines (l3m). In: Asian conference on machine learning, PMLR, pp 451–466
Wattenberg M, Viégas F, Johnson I (2016) How to use t-SNE effectively. Distill. https://doi.org/10.23915/distill.00002
Xia J, Ye F, Chen W, Wang Y, Chen W, Ma Y, Tung AKH (2018) Ldsscanner: exploratory analysis of low-dimensional structures in high-dimensional datasets. IEEE Trans Vis Comput Graph 24(1):236–245. https://doi.org/10.1109/TVCG.2017.2744098
Xia J, Zhang Y, Song J, Chen Y, Wang Y, Liu S (2021) Revisiting dimensionality reduction techniques for visual cluster analysis: an empirical study. IEEE Trans Vis Comput Graph 28(1):529–539
Yang W, Wang X, Lu J, Dou W, Liu S (2020) Interactive steering of hierarchical clustering. IEEE Trans Vis Comput Graph 27(10):3953–3967
Yuan J, Chen C, Yang W, Liu M, Xia J, Liu S (2021) A survey of visual analytics techniques for machine learning. Comput Vis Media 7(1):3–36. https://doi.org/10.1007/s41095-020-0191-7
Zeng W, Lin C, Lin J, Jiang J, Xia J, Turkay C, Chen W (2020) Revisiting the modifiable areal unit problem in deep traffic prediction with visual analytics. IEEE Trans Vis Comput Graph 27(2):839–848
Zhao Y, Shi J, Liu J, Zhao J, Zhou F, Zhang W, Chen K, Zhao X, Zhu C, Chen W (2021) Evaluating effects of background stories on graph perception. IEEE Trans Vis Comput Graph
Zhou Z, Shi C, Shen X, Cai L, Wang H, Liu Y, Zhao Y, Chen W (2020) Context-aware sampling of large networks via graph representation learning. IEEE Trans Vis Comput Graph 27(2):1709–1719
Acknowledgements
We would like to thank the helpful comments from the anonymous reviewers. This work is supported by the National Natural Science Foundation of China (61872389, 62077039).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zeng, X., Zhou, H., Li, Z. et al. iHELP: interactive hierarchical linear projections for interpreting non-linear projections. J Vis 26, 631–648 (2023). https://doi.org/10.1007/s12650-022-00900-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12650-022-00900-4