Skip to main content
Log in

On Methods and Measures for the Inspection of Arbitrarily Oriented Subspace Clusters

  • Schwerpunktbeitrag
  • Published:
Datenbank-Spektrum Aims and scope Submit manuscript

Abstract

When using arbitrarily oriented subspace clustering algorithms one obtains a partitioning of a given data set and for each partition its individual subspace. Since clustering is an unsupervised machine learning task, we may not have “ground truth” labels at our disposal or do not wish to rely on them. What is needed in such cases are internal measure which permits a label-less analysis of the obtained subspace clustering. In this work, we propose methods for revising clusters obtained from arbitrarily oriented correlation clustering algorithms. Initial experiments conducted reveal improvements in the clustering results compared to the original clustering outcome. Our proposed approach is simple and can be applied as a post-processing step on arbitrarily oriented correlation clusterings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. Not to be confused with linear discriminant analysis.

References

  1. Achtert E, Böhm C, Kriegel HP, Kröger P, Zimek A (2007) Robust, complete, and efficient correlation clustering. In: Proceedings of the 2007 SIAM International Conference on Data Mining, SIAM, pp 413–418

    Google Scholar 

  2. Achtert E, Böhm C, David J, Kröger P, Zimek A (2008) Global correlation clustering based on the hough transform. Stat Anal Data Min 1(3):111–127

    Article  MathSciNet  Google Scholar 

  3. Aggarwal CC, Yu PS (2000) Finding generalized projected clusters in high dimensional spaces. ACM 29:70–81

    Google Scholar 

  4. Böhm C, Kailing K, Kröger P, Zimek A (2004) Computing clusters of correlation connected objects. In: Proceedings of the 2004 ACM SIGMOD international conference on Management of data. ACM, New York, pp 455–466

    Chapter  Google Scholar 

  5. Böhm C, Achtert E, Kröger P, Zimek A, Kriegel H (2007) On exploring complex relationships of correlation clusters. In: 2007 International Conference on Scientific and Statistical Database Management(SSDBM), p 7

    Google Scholar 

  6. Ester M, Kriegel HP, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. KDD 96:226–231

    Google Scholar 

  7. Massey FJ Jr (1951) The kolmogorov-smirnov test for goodness of fit. J Am Stat Assoc 46(253):68–78

    Article  Google Scholar 

  8. Kambhatla N, Leen TK (1993) Fast nonlinear dimension reduction. In: IEEE International Conference on Neural Networks. IEEE, Piscataway Township, pp 1213–1218

    Chapter  Google Scholar 

  9. Lloyd S (1982) Least squares quantization in pcm. IEEE Trans Inf Theory 28(2):129–137

    Article  MathSciNet  Google Scholar 

  10. Moon TK (1996) The expectation-maximization algorithm. IEEE Signal Process Mag 13(6):47–60

    Article  Google Scholar 

  11. Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65

    Article  Google Scholar 

  12. Zimek A, Schubert E, Kriegel HP (2012) A survey on unsupervised outlier detection in high-dimensional numerical data. Statistical Analy Data Mining 5:363–387

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniyal Kazempour.

Additional information

Daniyal Kazempour conceptualized and wrote this work during his time at the Ludwig-Maximilians-University Munich.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kazempour, D., Winter, J., Kröger, P. et al. On Methods and Measures for the Inspection of Arbitrarily Oriented Subspace Clusters. Datenbank Spektrum 21, 213–223 (2021). https://doi.org/10.1007/s13222-021-00388-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13222-021-00388-6

Keywords

Navigation