Skip to main content

Local Search Approximation Algorithms for the Spherical k-Means Problem

  • Conference paper
  • First Online:
Algorithmic Aspects in Information and Management (AAIM 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11640))

Included in the following conference series:

Abstract

In this paper, we study the spherical k-means problem (SKMP) which is one of the most well-studied clustering problems. In the SKMP, we are given an n-client set \( \mathcal {D}\) in d-dimensional unit sphere \(\mathbb {S}^d\), and an integer \(k \le n\). The goal is to open a center subset \(F \subset \mathbb {S}^d\) with \( |F| \le k\) that minimizes the sum of cosine dissimilarity measure for each client in \( \mathcal {D} \) to the nearest open center. We give a \((2 (4+\sqrt{7}) + \varepsilon )\)-approximation algorithm for this problem using local search scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ahmadian, S., Norouzi-Fard, A., Svensson, O., Ward, J.: Better guarantees for \(k\)-means and Euclidean \(k\)-median by primal-dual algorithms. In: Proceedings of FOCS, pp. 61–72 (2017)

    Google Scholar 

  2. Aloise, D., Deshpande, A., Hansen, P., Popat, P.: NP-hardness of Euclidean sum-of-squares clustering. Mach. Learn. 75, 245–249 (2009)

    Article  Google Scholar 

  3. Arthur, D., Vassilvitskii, S.: \(k\)-means++: the advantages of careful seeding. In: Proceedings of SODA, pp. 1027–1035 (2007)

    Google Scholar 

  4. Arya, V., Garg, N., Khandekar, R., Meyerson, A., Munagala, K., Pandit, V.: Local search heuristics for \(k\)-median and facility location problems. SIAM J. Comput. 33, 544–562 (2004)

    Article  MathSciNet  Google Scholar 

  5. Byrka, J., Pensyl, T., Rybicki, B., Srinivasan, A., Trinh, K.: An improved approximation for \(k\)-median, and positive correlation in budgeted optimization. ACM Trans. Algorithms 13(2) (2017). Article No. 23

    Google Scholar 

  6. Charikar, M., Guha, S.: Improved combinatorial algorithms for the facility location and \(k\)-median problems. In: Proceedings of FOCS, pp. 378–388 (1999)

    Google Scholar 

  7. Charikar, M., Guha, S., Tardos, \(\acute{\rm E}\)., Shmoys, D.B.: A constant-factor approximation algorithm for the \(k\)-median problem. J. Comput. Syst. Sci. 65, 129–149 (2002)

    Google Scholar 

  8. Cohen-Addad, V., Mathieu, C.: Effectiveness of local search for geometric optimization. In: Proceedings of SoCG, pp. 329–343 (2015)

    Google Scholar 

  9. Cohen-Addad, V., Klein, P.N., Mathieu, C.: Local search yields approximation schemes for \(k\)-means and \(k\)-median in Euclidean and minor-free metrics. In: Proceedings of FOCS, pp. 353–364 (2016)

    Google Scholar 

  10. Cohen-Addad, V., Schwiegelshohn, C.: On the local structure of stable clustering instances. In: Proceedings of FOCS, pp. 49–60 (2017)

    Google Scholar 

  11. Dasgupta, S.: The hardness of \(k\)-means clustering. Technical report CS2007-0890, University of California, San Diego (2007)

    Google Scholar 

  12. Dhillon, I.S., Modha, D.S.: Concept decompositions for large sparse text data using clustering. Mach. Learn. 42, 143–175 (2001)

    Article  Google Scholar 

  13. Endo, Y., Miyamoto, S.: Spherical \(k\)-means\(++\) clustering. In: Proceedings of MDAI, pp. 103–114 (2015)

    Google Scholar 

  14. Friggstad, Z., Rezapour, M., Salavatipour, M.R.: Local search yields a PTAS for \(k\)-means in doubling metrics. In: Proceedings of FOCS, pp. 365–374 (2016)

    Google Scholar 

  15. Friggstad, Z., Zhang, Y.: Tight analysis of a multiple-swap heurstic for budgeted red-blue median. In: Proceedings of ICALP, pp. 75:1–75:13 (2016)

    Google Scholar 

  16. Hornik, K., Feinerer, I., Kober, M., Buchta, C.: Spherical \(k\)-means clustering. J. Stat. Softw. 50(10), 1–22 (2012)

    Article  Google Scholar 

  17. Jain, K., Vazirani, V.V.: Approximation algorithms for metric facility location and \(k\)-median problems using the primal-dual schema and Lagrangian relaxation. J. ACM 48, 274–296 (2001)

    Article  MathSciNet  Google Scholar 

  18. Kanungo, T., Mount, D.M., Netanyahu, N.S., Piatko, C.D., Silverman, R., Wu, A.Y.: A local search approximation algorithm for \(k\)-means clustering. Comput. Geom. Theory Appl. 28, 89–112 (2004)

    Article  MathSciNet  Google Scholar 

  19. Li, M., Xu, D., Zhang, D., Zou, J.: The seeding algorithms for spherical \(k\)-means clustering. J. Glob. Optim. 1–14 (2019). https://doi.org/10.1007/s10898-019-00779-w

  20. Li, S., Svensson, O.: Approximating \(k\)-median via pseudo-approximation. SIAM J. Comput. 45, 530–547 (2016)

    Article  MathSciNet  Google Scholar 

  21. Lloyd, S.: Least squares quantization in PCM. Technical report, Bell Laboratories (1957)

    Google Scholar 

  22. Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28, 129–137 (1982)

    Article  MathSciNet  Google Scholar 

  23. Mahajan, M., Nimbhorkar, P., Varadarajan K.: The planar \(k\)-means problem is NP-hard. In: Proceedings of WALCOM, pp. 274–285 (2009)

    Google Scholar 

  24. Makarychev, K., Makarychev, Y., Sviridenko, M., Ward, J.: A bi-criteria approximation algorithm for \(k\)-means. In: Proceedings of APPROX/RONDOM, pp. 14:1–14:20 (2016). Article No. 14

    Google Scholar 

  25. Matou\(\check{\rm s}\)ek, J.: On approximate geometric \(k\)-clustering. Discrete Comput. Geom. 24, 61–84 (2000)

    Google Scholar 

  26. Zhang, P.: A new approximation algorithm for the \(k\)-facility location problem. Theor. Comput. Sci. 384, 126–135 (2007)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

The first author is supported by Natural Science Foundation of China (No. 11871081). The second author is supported by Natural Science Foundation of China (No. 11871366). The third author is the Higher Educational Science and Technology Program of Shandong Province (No. J17KA171). The fourth author is supported by Natural Science Foundation of China (No. 61433012), Shenzhen Research Grant (KQJSCX2018033017 0311901, JCYJ20180305180840138 and GFW2017073114031767), and Hong Kong GRF 17210017. The fifth author is supported by Natural Science Foundation of China (No. 11531014).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yukun Cheng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, D., Cheng, Y., Li, M., Wang, Y., Xu, D. (2019). Local Search Approximation Algorithms for the Spherical k-Means Problem. In: Du, DZ., Li, L., Sun, X., Zhang, J. (eds) Algorithmic Aspects in Information and Management. AAIM 2019. Lecture Notes in Computer Science(), vol 11640. Springer, Cham. https://doi.org/10.1007/978-3-030-27195-4_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-27195-4_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-27194-7

  • Online ISBN: 978-3-030-27195-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics