Skip to main content

Programming by Optimisation Meets Parameterised Algorithmics: A Case Study for Cluster Editing

  • Conference paper
  • First Online:
Learning and Intelligent Optimization (LION 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8994))

Included in the following conference series:

Abstract

Inspired by methods and theoretical results from parameterised algorithmics, we improve the state of the art in solving Cluster Editing, a prominent NP-hard clustering problem with applications in computational biology and beyond. In particular, we demonstrate that an extension of a certain preprocessing algorithm, called the \((k+1)\)-data reduction rule in parameterised algorithmics, embedded in a sophisticated branch-&-bound algorithm, improves over the performance of existing algorithms based on Integer Linear Programming (ILP) and branch-&-bound. Furthermore, our version of the \((k+1)\)-rule outperforms the theoretically most effective preprocessing algorithm, which yields a 2k-vertex kernel. Notably, this 2k-vertex kernel is analysed empirically for the first time here. Our new algorithm was developed by integrating Programming by Optimisation into the classical algorithm engineering cycle – an approach which we expect to be successful in many other contexts.

Sepp Hartung—Major parts of this work were done during a research visit of SH at the University of British Columbia in Vancouver (Canada), supported by a “DFG Forschungsstipendium” (HA 7296/1-1).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Notably, our implementation is still able to solve \(M\)-Tree Clustering. However, here our focus is on improving over state-of-the-art exact solvers for Cluster Editing.

  2. 2.

    http://www.mi.fu-berlin.de/w/LiSA/YoshikoCharles.

  3. 3.

    We removed the largest instance with 8836 vertices from the dataset. It is more than two times larger than the second largest instance and could not be solved.

  4. 4.

    http://fpt.akt.tu-berlin.de/cluEdit/.

  5. 5.

    PAR-10 is the average with timeouts counted as 10 times the cut-off time.

References

  1. Gurobi 5.62. Software (2014)

    Google Scholar 

  2. Agarwala, R., Bafna, V., Farach, M., Narayanan, B., Paterson, M., Thorup, M.: On the approximability of numerical taxonomy (fitting distances by tree matrices). SIAM J. Comput. 28(3), 1073–1085 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  3. Ailon, N., Charikar, M.: Fitting tree metrics: hierarchical clustering and phylogeny. In: Proceedings of the 46th FOCS, pp. 73–82 (2005)

    Google Scholar 

  4. Bansal, N., Blum, A., Chawla, S.: Correlation clustering. Mach. Learn. 56(1–3), 89–113 (2004)

    Article  MATH  Google Scholar 

  5. Böcker, S.: A golden ratio parameterized algorithm for cluster editing. J. Discrete Algorithms 16, 79–89 (2012)

    Article  MATH  MathSciNet  Google Scholar 

  6. Böcker, S., Baumbach, J.: Cluster editing. In: Bonizzoni, P., Brattka, V., Löwe, B. (eds.) CiE 2013. LNCS, vol. 7921, pp. 33–44. Springer, Heidelberg (2013)

    Google Scholar 

  7. Böcker, S., Briesemeister, S., Klau, G.W.: Exact algorithms for cluster editing: evaluation and experiments. Algorithmica 60(2), 316–334 (2011)

    Article  MATH  MathSciNet  Google Scholar 

  8. Bonchi, F., Gionis, A., Gullo, F., Ukkonen, A.: Chromatic correlation clustering. In: Proceedings of 18th ACM SIGKDD (KDD 2012), pp. 1321–1329. ACM Press (2012)

    Google Scholar 

  9. Cao, Y., Chen, J.: On parameterized and kernelization algorithms for the hierarchical clustering problem. In: Chan, T.-H., Lau, L., Trevisan, L. (eds.) TAMC 2013. LNCS, vol. 7876, pp. 319–330. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  10. Charikar, M., Guruswami, V., Wirth, A.: Clustering with qualitative information. J. Comput. Syst. Sci. 71(3), 360–383 (2005)

    Article  MATH  MathSciNet  Google Scholar 

  11. Chen, J., Meng, J.: A \(2k\) kernel for the cluster editing problem. J. Comput. Syst. Sci. 78(1), 211–220 (2012)

    Article  MATH  MathSciNet  Google Scholar 

  12. Chierichetti, F., Dalvi, N., Kumar, R.: Correlation clustering in MapReduce. In: Proceedings of 20th ACM SIGKDD (KDD 2014), pp. 641–650. ACM Press (2014)

    Google Scholar 

  13. de Oca, M.A.M., Aydin, D., Stützle, T.: An incremental particle swarm for large-scale continuous optimization problems: an example of tuning-in-the-loop (re)design of optimization algorithms. Soft Comput. 15(11), 2233–2255 (2011)

    Article  Google Scholar 

  14. Downey, R.G., Fellows, M.R.: Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, London (2013)

    Book  MATH  Google Scholar 

  15. Fawcett, C., Hoos, H.H.: Analysing differences between algorithm configurations through ablation. In: Proceedings of 10th MIC, pp. 123–132 (2013)

    Google Scholar 

  16. Fellows, M.R., Langston, M.A., Rosamond, F.A., Shaw, P.: Efficient parameterized preprocessing for cluster editing. In: Csuhaj-Varjú, E., Ésik, Z. (eds.) FCT 2007. LNCS, vol. 4639, pp. 312–321. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  17. Gramm, J., Guo, J., Hüffner, F., Niedermeier, R.: Graph-modeled data clustering: exact algorithms for clique generation. Theory Comput. Syst. 38(4), 373–392 (2005)

    Article  MATH  MathSciNet  Google Scholar 

  18. Grötschel, M., Wakabayashi, Y.: A cutting plane algorithm for a clustering problem. Math. Program. 45(1–3), 59–96 (1989)

    Article  MATH  Google Scholar 

  19. Guo, J.: A more effective linear kernelization for cluster editing. Theor. Comput. Sci. 410(8–10), 718–726 (2009)

    Article  MATH  Google Scholar 

  20. Guo, J., Hartung, S., Komusiewicz, C., Niedermeier, R., Uhlmann, J.: Exact algorithms and experiments for hierarchical tree clustering. In Proceedings of 24th AAAI. AAAI Press (2010)

    Google Scholar 

  21. Hoos, H.H.: Programming by optimization. Commun. ACM 55(2), 70–80 (2012)

    Article  Google Scholar 

  22. Hutter, F., Hoos, H.H., Leyton-Brown, K.: Sequential model-based optimization for general algorithm configuration. In: Coello, C.A.C. (ed.) LION 5 2011. LNCS, vol. 6683, pp. 507–523. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  23. Niedermeier, R.: Invitation to Fixed-Parameter Algorithms. Oxford University Press, Oxford (2006)

    Book  MATH  Google Scholar 

  24. Sanders, P., Wagner, D.: Algorithm engineering. It - Inf. Technol. 53(6), 263–265 (2011)

    Article  Google Scholar 

  25. van Zuylen, A., Williamson, D.P.: Deterministic algorithms for rank aggregation and other ranking and clustering problems. In: Kaklamanis, C., Skutella, M. (eds.) WAOA 2007. LNCS, vol. 4927, pp. 260–273. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

Download references

Acknowledgement

We thank Tomasz Przedmojski who provided, as part of his bachelor thesis, an accelerated implementation of the \(\mathcal {O}(M\cdot k)\) kernel [20].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sepp Hartung .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Hartung, S., Hoos, H.H. (2015). Programming by Optimisation Meets Parameterised Algorithmics: A Case Study for Cluster Editing. In: Dhaenens, C., Jourdan, L., Marmion, ME. (eds) Learning and Intelligent Optimization. LION 2015. Lecture Notes in Computer Science(), vol 8994. Springer, Cham. https://doi.org/10.1007/978-3-319-19084-6_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19084-6_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19083-9

  • Online ISBN: 978-3-319-19084-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics