Abstract
Research data obtained during economics or human studies experiments often displays a complex distribution. Even in the two-dimensional case, the statistical identification of subgroups in research data poses an analytical challenge. Here we introduce an interactive R-based tool called “AdaptGauss2D”. It enables a valid identification of a meaningful multimodal structure in two-dimensional data. With a human-in-the-loop approach, a Gaussian mixture model (GMM) can be fitted to the data. The interactive interface allows a supervised selection of the number and parameters of the GMM based on various visualizations. Integrating a Human-in-the-loop into the process of modeling two-dimensional gaussian mixtures enables the expectation-maximization (EM) algorithm to adapt to more complex GMM compared to the standard non-interactive approach. The work demonstrates that the interactive modeling process for GMM improves the quality of the model in contrast to non-interactive modeling. The improvement is shown using the datasets of EngyTime and a large flow cytometry dataset. The R package “AdaptGauss2D” is available on GitHub https://github.com/Mthrun/AdaptGauss2D.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baggenstoss, P.M.: Statistical modeling using Gaussian mixtures and HMMS with matlab. Naval Undersea Warfare Center, Newport RI (2002)
Yoshida, E., Kimura, Y., Kitamura, K., Murayama, H.: Calibration procedure for a DOI detector of high resolution PET through a Gaussian mixture model. IEEE Trans. Nucl. Sci. 51(5), 2543–2549 (2004)
Yu, J.: Bearing performance degradation assessment using locality preserving projections and Gaussian mixture models. Mech. Syst. Signal Process. 25(7), 2573–2588 (2011)
Wang, et al.: Efficient volume exploration using the Gaussian mixture model. IEEE Trans. Vis. Comput. Graph. 17(11), 1560–1573 (2011)
Yang, M.-S., Lai, C.-Y., Lin, C.-Y.: A robust EM clustering algorithm for Gaussian mixture models. Pattern Recogn. 45(11), 3950–3961 (2012)
Ultsch, et al.: Identification of molecular fingerprints in human heat pain thresholds by use of an interactive mixture model R toolbox (AdaptGauss). Int. J. Mol. Sci. 16(10), 25897–25911 (2015). https://doi.org/10.3390/ijms161025897
Thrun, M.C., Stier, Q.: Fundamental clustering algorithms suite SoftwareX 13(C), 100642 (2021). https://doi.org/10.1016/j.softx.2020.100642
Thrun, M.C., Ultsch, A.: Clustering benchmark datasets exploiting the fundamental clustering problems. Data Brief 30(C), 105501 (2020). https://doi.org/10.1016/j.dib.2020.105501
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Thrun, M.C., Stier, Q., Ultsch, A. (2023). Interactive Toolbox for Two-Dimensional Gaussian Mixture Modeling. In: Amini, MR., Canu, S., Fischer, A., Guns, T., Kralj Novak, P., Tsoumakas, G. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13718. Springer, Cham. https://doi.org/10.1007/978-3-031-26422-1_51
Download citation
DOI: https://doi.org/10.1007/978-3-031-26422-1_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26421-4
Online ISBN: 978-3-031-26422-1
eBook Packages: Computer ScienceComputer Science (R0)