Skip to main content

Learning a metric when clustering data points in the presence of constraints

  • Regular Article
  • Published:
Advances in Data Analysis and Classification Aims and scope Submit manuscript

Abstract

Learning an appropriate distance measure under supervision of side information has become a topic of significant interest within machine learning community. In this paper, we address the problem of metric learning for constrained clustering by considering three important issues: (1) considering importance degree for constraints, (2) preserving the topological structure of data, and (3) preserving some natural distribution properties in the data. This work provides a unified way to handle different issues in constrained clustering by learning an appropriate distance measure. It has modeled the first issue by injecting the importance degree of constraints directly into an objective function. The topological structure of data is preserved by minimizing the reconstruction error of data in the target space. Finally we addressed the issue of preserving natural distribution properties in the data by using the proximity information of data. We have proposed two different methods to address the above mentioned issues. The first approach learns a linear transformation of data into a target space (linear-model) and the second one uses kernel functions to learn an appropriate distance measure (non-linear-model). Experiments show that considering these issues significantly improves clustering accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Notes

  1. http://archive.ics.uci.edu/ml/.

References

  • Abin AA (2016a) Clustering with side information: further efforts to improve efficiency. Pattern Recognit Lett 84:252–258

    Article  Google Scholar 

  • Abin AA (2016b) Querying beneficial constraints before clustering using facility location analysis. IEEE Trans Cybern 99:1–12

    Google Scholar 

  • Abin AA, Beigy H (2014) Active selection of clustering constraints: a sequential approach. Pattern Recognit 47(3):1443–1458

    Article  Google Scholar 

  • Abin AA, Beigy H (2015) Active constrained fuzzy clustering: a multiple kernels learning approach. Pattern Recognit 48(3):953–967

    Article  MATH  Google Scholar 

  • Arenas A, Diaz-Guilera A, Kurths J, Moreno Y, Zhou C (2008) Synchronization in complex networks. Phys Rep 469(3):93–153

    Article  MathSciNet  Google Scholar 

  • Baghshah MS, Afsari F, Shouraki SB, Eslami E (2014) Scalable semi-supervised clustering by spectral kernel learning. Pattern Recognit Lett 45:161–171

    Article  Google Scholar 

  • Bar-Hillel A, Hertz T, Shental N, Weinshall D (2005) Learning a Mahalanobis metric from equivalence constraints. J Mach Learn Res 6:937–965

    MathSciNet  MATH  Google Scholar 

  • Basu S, Banerjee A, Mooney RJ (2004) Active semi-supervision for pairwise constrained clustering. In: Proceedings of the 5th SIAM international conference on data mining, ICDM ’04, pp 333–344

  • Basu S, Davidson I, Wagstaff KL (2008) Constrained clustering: advances in algorithms, theory, and applications. Chapman and Hall/CRC, Boca Raton

    MATH  Google Scholar 

  • Bilenko M, Basu S, Mooney RJ (2004a) Integrating constraints and metric learning in semi-supervised clustering. In: Proceedings of the 21th international conference on Machine learning, ICML ’04, pp 11–18

  • Bilenko M, Basu S, Mooney RJ (2004b) Integrating constraints and metric learning in semi-supervised clustering. In: Proceedings of the twenty-first international conference on Machine learning, ACM, p 11

  • Chang H, yan Yeung D (2004) Locally linear metric adaptation for semi-supervised clustering. In: Proceedings of the 21th international conference on machine learning, ICML ’04, pp 153–160

  • Cheng H, Hua KA, Vu K (2008) Constrained locally weighted clustering. Proc VLDB Endow 1(1):90–101

    Article  Google Scholar 

  • Davidson I, Wagstaff KL, Basu S (2006) Measuring constraint-set utility for partitional clustering algorithms. In: Proceedings of the 10th European conference on Principle and practice of knowledge discovery in databases, PKDD ’06, pp 115–126

    Google Scholar 

  • Ding S, Jia H, Zhang L, Jin F (2014) Research of semi-supervised spectral clustering algorithm based on pairwise constraints. Neural Comput Appl 24(1):211–219

    Article  Google Scholar 

  • Gong C, Fu K, Wu Q, Tu E, Yang J (2014) Semi-supervised classification with pairwise constraints. Neurocomputing 139:130–137

    Article  Google Scholar 

  • He P, Xu X, Hu K, Chen L (2014) Semi-supervised clustering via multi-level random walk. Pattern Recognit 47(2):820–832

    Article  MATH  Google Scholar 

  • Hertz T, Bar-hillel A, Weinshall D (2004) Boosting margin based distance functions for clustering. In: Proceedings of the 21th international conference on machine learning, ICML ’04, pp 393–400

  • Hoi SCH, Jin R, Lyu MR (2007) Learning non-parametric kernel matrices from pairwise constraints. In: Proceedings of the 24th international conference on Machine learning, ICML ’07, pp 361–368

  • Hubert L, Arabie P (1985) Comparing Partitions. J Classif 2(1):193–218

    Article  MATH  Google Scholar 

  • Jain P, Kulis B, Davis JV, Dhillon IS (2012) Metric and kernel learning using a linear transformation. J Mach Learn Res 13:519–547

    MathSciNet  MATH  Google Scholar 

  • Kalakech M, Biela P, Macaire L, Hamad D (2011) Constraint scores for semi-supervised feature selection: a comparative study. Pattern Recognit Lett 32(5):656–665

    Article  Google Scholar 

  • Khoreva A, Galasso F, Hein M, Schiele B (2014) Learning must-link constraints for video segmentation based on spectral clustering. In: Pattern recognition, Springer, pp 701–712

  • Korinna Bade AN (2014) Hierarchical constraints. Mach Learn 94(3):371–399

    Article  MathSciNet  MATH  Google Scholar 

  • Kulis B, Basu S, Dhillon I, Mooney R (2009) Semi-supervised graph clustering: a kernel approach. Mach Learn 74(1):1–22

    Article  Google Scholar 

  • LeCun Y, Cortes C (2010) MNIST handwritten digit database http://yann.lecun.com/exdb/mnist/

  • Liu H, Wu Z, Li X, Cai D, Huang T (2012) Constrained non-negative matrix factorization for image representation. IEEE Trans Pattern Anal Mach Intell 34(7):1299–1311

    Article  Google Scholar 

  • Melnykov V, Melnykov I, Michael S (2016) Semi-supervised model-based clustering with positive and negative constraints. Adv Data Anal Classif 10(3):327–349

    Article  MathSciNet  MATH  Google Scholar 

  • Milligan G, Cooper M (1986) A study of the comparability of external criteria for hierarchical cluster analysis. Multivariate Behav Res 21(4):441–458

    Article  Google Scholar 

  • Okabe M, Yamada S (2009) Clustering with constrained similarity learning. In: Proceedings of the 2009 IEEE/WIC/ACM international conference on web intelligence and international conference on intelligent agent technology, pp 30–33

  • Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326

    Article  Google Scholar 

  • Sinkkonen J, Kaski S (2002) Clustering based on conditional distributions in an auxiliary space. Neural Comput 14:217–239

    Article  MATH  Google Scholar 

  • Smieja M, Wiercioch M (2017) Constrained clustering with a complex cluster structure. Adv Data Anal Classif 11(3):493–518

    Article  MathSciNet  MATH  Google Scholar 

  • Soleymani Baghshah M, Bagheri Shouraki S (2010) Non-linear metric learning using pairwise similarity and dissimilarity constraints and the geometrical structure of data. Pattern Recognit 43:2982–2992

    Article  MATH  Google Scholar 

  • Truong D, Battiti R (2013) A flexible cluster-oriented alternative clustering algorithm for choosing from the pareto front of solutions. Mach Learn 98(1):57–91

    MathSciNet  MATH  Google Scholar 

  • Vu VV, Labroche N, Bouchon-Meunier B (2012) Improving constrained clustering with active query selection. Pattern Recognit 45(4):1749–1758

    Article  Google Scholar 

  • Wagstaff K, Cardie C (2000) Clustering with instance-level constraints. In: Proceedings of the seventeenth national conference on artificial intelligence and twelfth conference on on innovative applications of artificial intelligence, July 30–August 3, 2000, Austin, Texas, USA, p 1097

  • Wagstaff K, Cardie C, Rogers S, Schrödl S (2001) Constrained k-means clustering with background knowledge. In: Proceedings of the 18th international conference on machine learning, ICML ’01, pp 577–584

  • Wang Q, Yuen PC, Feng G (2013) Semi-supervised metric learning via topology preserving multiple semi-supervised assumptions. Pattern Recognit 46(9):2576–2587

    Article  MATH  Google Scholar 

  • Wu S, Feng X, Zhou W (2014) Spectral clustering of high-dimensional data exploiting sparse representation vectors. Neurocomputing 135:229–239

    Article  Google Scholar 

  • Xiang S, Nie F, Zhang C (2008) Learning a Mahalanobis distance metric for data clustering and classification. Pattern Recognit 41(12):3600–3612

    Article  MATH  Google Scholar 

  • Ye J, Zhao Z, Liu H (2007) Adaptive distance metric learning for clustering. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition (CVPR 2007), pp 1–7

  • Yin X, Chen S, Hu E, Zhang D (2010) Semi-supervised clustering with metric learning: an adaptive kernel method. Pattern Recognit 43(4):1320–1333

    Article  MATH  Google Scholar 

  • Zhang Z, Zhao M, Chow TWS (2012) Marginal semi-supervised sub-manifold projections with informative constraints for dimensionality reduction and recognition. Neural Netw 36:97–111

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahmad Ali Abin.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Abin, A.A., Bashiri, M.A. & Beigy, H. Learning a metric when clustering data points in the presence of constraints. Adv Data Anal Classif 14, 29–56 (2020). https://doi.org/10.1007/s11634-019-00359-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11634-019-00359-6

Keywords

Mathematics Subject Classification