Augmenting Automatic Clustering with Expert Knowledge and Explanations

Bobek, Szymon; Nalepa, Grzegorz J.

doi:10.1007/978-3-030-77970-2_48

Augmenting Automatic Clustering with Expert Knowledge and Explanations

Conference paper
First Online: 09 June 2021

1144 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12745))

Abstract

Cluster discovery from highly-dimensional data is a challenging task, that has been studied for years in the fields of data mining and machine learning. Most of them focus on automation of the process, resulting in the clusters that once discovered have to be carefully analyzed to assign semantics for numerical labels. However, it is often the case that such an explicit, symbolic knowledge about possible clusters is available prior to clustering and can be used to enhance the learning process. More importantly, we demonstrate how a machine learning model can be used to refine the expert knowledge and extend it with an aid of explainable AI algorithms. We present our framework on an artificial, reproducible dataset.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
See the project webpage at http://PACMEL.geist.re.

References

Ali, M., Jones, M.W., Xie, X., Williams, M.: TimeCluster: dimension reduction applied to temporal data for visual analytics. Vis. Comput. 35(6–8), 1013–1026 (2019)
Article Google Scholar
Atzmueller, M.: Experience management with task-configurations and task-patterns for descriptive data mining. In: Proceedings of KESE 2007, 30th German Conference on Artificial Intelligence (KI-2007) (2007)
Google Scholar
Atzmueller, M., Seipel, D.: Declarative specification of ontological domain knowledge for descriptive data mining (extended version). In: Proceedings of 18th International Conference on Applications of Declarative Programming and Knowledge Management (2008)
Google Scholar
Coden, A., Danilevsky, M., Gruhl, D., Kato, L., Nagarajan, M.: A method to accelerate human in the loop clustering, pp. 237–245
Google Scholar
Lundberg, S.M., et al.: Explainable AI for trees: from local explanations to global understanding. CoRR. arXiv:1905.04610 (2019)
McInnes, L., Healy, J., Melville, J.: UMAP: uniform manifold approximation and projection for dimension reduction (2020)
Google Scholar
Mujkanovic, F., Doskoč, V., Schirneck, M., Schäfer, P., Friedrich, T.: timeXplain - a framework for explaining the predictions of time series classifiers (2020)
Google Scholar
Pope, P.E., Kolouri, S., Rostami, M., Martin, C.E., Hoffmann, H.: Explainability methods for graph convolutional neural networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10764–10773 (2019)
Google Scholar
Ribeiro, M.T., Singh, S., Guestrin, C.: “Why should i trust you?”: explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016, pp. 1135–1144. Association for Computing Machinery, New York (2016)
Google Scholar
Ribeiro, M.T., Singh, S., Guestrin, C.: Anchors: high-precision model-agnostic explanations. In: AAAI (2018)
Google Scholar
Wenskovitch, J., North, C.: Observation-level interaction with clustering and dimension reduction algorithms. In: HILDA 2017. Association for Computing Machinery, New York (2017)
Google Scholar
Zhang, L., Kalashnikov, D.V., Mehrotra, S.: Context-assisted face clustering framework with human-in-the-loop. Int. J. Multimedia Inf. Retrieval 3(2), 69–88 (2014). https://doi.org/10.1007/s13735-014-0052-1
Article Google Scholar

Download references

Acknowledgements

The paper is funded from the PACMEL project funded by the National Science Centre, Poland under CHIST-ERA programme (NCN 2018/27/Z/ST6/03392). The authors are grateful to ACK Cyfronet, Krakow for granting access to the computing infrastructure built in the projects No. POIG.02.03.00-00-028/08 “PLATON - Science Services Platform” and No. POIG.02.03.00-00-110/13 “Deploying high-availability, critical services in Metropolitan Area Networks (MAN-HA)”.

Author information

Authors and Affiliations

Jagiellonian Human-Centered Artificial Intelligence Laboratory (JAHCAI) and Institute of Applied Computer Science, Jagiellonian University, 31-007, Kraków, Poland
Szymon Bobek & Grzegorz J. Nalepa
AGH University of Science and Technology, Kraków, Poland
Szymon Bobek & Grzegorz J. Nalepa

Authors

Szymon Bobek
View author publications
You can also search for this author in PubMed Google Scholar
Grzegorz J. Nalepa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Szymon Bobek .

Editor information

Editors and Affiliations

AGH University of Science and Technology, Krakow, Poland
Maciej Paszynski
Ludwig-Maximilians-Universität München, Munich, Germany
Dieter Kranzlmüller
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M.A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bobek, S., Nalepa, G.J. (2021). Augmenting Automatic Clustering with Expert Knowledge and Explanations. In: Paszynski, M., Kranzlmüller, D., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational Science – ICCS 2021. ICCS 2021. Lecture Notes in Computer Science(), vol 12745. Springer, Cham. https://doi.org/10.1007/978-3-030-77970-2_48

Download citation

DOI: https://doi.org/10.1007/978-3-030-77970-2_48
Published: 09 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77969-6
Online ISBN: 978-3-030-77970-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics