Skip to main content

Abstract

Clustering has been proven to produce better results when applied to learning problems that fall under semi-supervised paradigms, where only incomplete or partial information about the dataset is available to perform clustering. Classic constrained clustering and recent monotonic clustering problems belong to the semi-supervised learning paradigm, although a combination of both has never been addressed. This study aims to prove that the fusion of the background knowledge leveraged by the two aforementioned semi-supervised clustering techniques results in improved performance. To do so, a hybrid objective function combining them is proposed and optimized by means of an expectation minimization scheme. The capabilities of the proposed method are tested in a wide variety of datasets with incremental levels of background knowledge and compared to purely monotonic clustering and purely constrained clustering methods belonging to the state-of-the-art. Bayesian statistical testing is used to validate the obtained results.

Our work has been supported by the research projects PID2020-119478GB-I00, A-TIC-434-UGR20 and PREDOC_01648.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://sci2s.ugr.es/keel/category.php?cat=clas.

References

  1. Benavoli, A., Corani, G., Demšar, J., Zaffalon, M.: Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis. J. Mach. Learn. Res. 18(1), 2653–2688 (2017)

    MathSciNet  MATH  Google Scholar 

  2. Bradley, P.S., Bennett, K.P., Demiriz, A.: Constrained K-means clustering. Microsoft Research, Redmond, vol. 20 (2000)

    Google Scholar 

  3. Cano, J.R., Gutiérrez, P.A., Krawczyk, B., Woźniak, M., García, S.: Monotonic classification: an overview on algorithms, performance measures and data sets. Neurocomputing 341, 168–182 (2019)

    Article  Google Scholar 

  4. Carrasco, J., García, S., del Mar Rueda, M., Herrera, F.: rNPBST: an R Package covering non-parametric and Bayesian statistical tests. In: Martínez de Pisón, F.J., Urraca, R., Quintián, H., Corchado, E. (eds.) HAIS 2017. LNCS (LNAI), vol. 10334, pp. 281–292. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59650-1_24

    Chapter  Google Scholar 

  5. Carrasco, J., García, S., Rueda, M., Das, S., Herrera, F.: Recent trends in the use of statistical tests for comparing swarm and evolutionary computing algorithms: practical guidelines and a critical review. Swarm Evol. Comput. 54, 100665 (2020)

    Article  Google Scholar 

  6. Davidson, I., Basu, S.: A survey of clustering with instance level constraints. ACM Trans. Knowl. Discov. Data 1, 1–41 (2007)

    Google Scholar 

  7. González, S., Herrera, F., García, S.: Monotonic random forest with an ensemble pruning mechanism based on the degree of monotonicity. N. Gener. Comput. 33(4), 367–388 (2015)

    Article  Google Scholar 

  8. González-Almagro, G., Luengo, J., Cano, J.R., García, S.: DILS: constrained clustering through dual iterative local search. Comput. Oper. Res. 121, 104979 (2020)

    Article  MathSciNet  Google Scholar 

  9. Hubert, L., Arabie, P.: Comparing partitions. J. classif. 2(1), 193–218 (1985)

    Article  Google Scholar 

  10. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)

    Article  Google Scholar 

  11. Law, M.H.C., Topchy, A., Jain, A.K.: Clustering with soft and group constraints. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds.) SSPR /SPR 2004. LNCS, vol. 3138, pp. 662–670. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-27868-9_72

    Chapter  MATH  Google Scholar 

  12. Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28(2), 129–137 (1982)

    Article  MathSciNet  Google Scholar 

  13. Rosenfeld, J., De Smet, Y., Debeir, O., Decaestecker, C.: Assessing partially ordered clustering in a multicriteria comparative context. Pattern Recogn. 114, 107850 (2021)

    Article  Google Scholar 

  14. Schmidt, J., Brandle, E.M., Kramer, S.: Clustering with attribute-level constraints. In: 2011 IEEE 11th International Conference on Data Mining, pp. 1206–1211. IEEE (2011)

    Google Scholar 

  15. Triguero, I., et al.: KEEL 3.0: an open source software for multi-stage analysis in data mining. Int. J. Comput. Intell. Syst. 10(1), 1238–1249 (2017)

    Google Scholar 

  16. Vouros, A., Vasilaki, E.: A semi-supervised sparse K-means algorithm. Pattern Recogn. Lett. 142, 65–71 (2021)

    Article  Google Scholar 

  17. Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S.: Constrained K-means clustering with background knowledge. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 577–584. Morgan Kaufmann Publishers Inc. (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Germán González-Almagro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

González-Almagro, G., Bermejo, P.S., Suarez, J.L., Cano, JR., García, S. (2022). Monotonic Constrained Clustering: A First Approach. In: Fujita, H., Fournier-Viger, P., Ali, M., Wang, Y. (eds) Advances and Trends in Artificial Intelligence. Theory and Practices in Artificial Intelligence. IEA/AIE 2022. Lecture Notes in Computer Science(), vol 13343. Springer, Cham. https://doi.org/10.1007/978-3-031-08530-7_61

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-08530-7_61

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-08529-1

  • Online ISBN: 978-3-031-08530-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics