skip to main content
10.1145/3404835.3463092acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Communication Efficient Distributed Hypergraph Clustering

Authors Info & Claims
Published:11 July 2021Publication History

ABSTRACT

Hypergraphs can capture higher-order relations between subsets of objects instead of only pairwise relations as in graphs. Hypergraph clustering is an important task in information retrieval and machine learning. We study the problem of distributed hypergraph clustering in the message passing communication model using small communication cost. We propose an algorithm framework for distributed hypergraph clustering based on spectral hypergraph sparsification. For an n-vertex hypergraph G with hyperedges of maximum size r distributed at s sites arbitrarily and a parameter ε∈ (0,1), our algorithm can produce a vertex set with conductance O(√1+ε/1-ε √φG), where φG is the conductance of G, using communication cost ~O(nr2s/εO(1)) (~O hides a polylogarithmic factor). The theoretical results are complemented with extensive experiments to demonstrate the efficiency and effectiveness of the proposed algorithm under different real-world datasets. Our source code is publicly available at github.com/chunjiangzhu/dhgc.

Skip Supplemental Material Section

Supplemental Material

SIGIR2021-sp1666.mp4

mp4

12.4 MB

References

  1. I. Abraham, D. Durfee, I. Koutis, S. Krinninger, and R. Peng. 2016. On fully dynamic graph sparsifiers. In Proceedings of IEEE Symposium on Foundations of Computer Science. 335--344.Google ScholarGoogle Scholar
  2. S. Agarwal, J. Lim, L. Zelnik-Manor, P. Perona, D. Kriegman, and S. Belongie. 2005. Beyond pairwise clustering. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. 838--845.Google ScholarGoogle Scholar
  3. M.-F. Balcan, S. Ehrlich, and Y. Liang. 2013. Distributed k-means and k-median clustering on general communication topologies. In Proceedings of Neural Information Processing Systems Conference. 1995--2003.Google ScholarGoogle Scholar
  4. M.-F. Balcan, V. Kanchanapally, Y. Liang, and D. P. Woodruff. 2014. Improved distributed principal component analysis. In Proceedings of Neural Information Processing Systems Conference. 3113--3121.Google ScholarGoogle Scholar
  5. N. Bansal, O. Svensson, and L. Trevisan. 2019. New notions and constructions of sparsification for graphs and hypergraphs. In Proceedings of IEEE Symposium on Foundations of Computer Science. 910--928.Google ScholarGoogle Scholar
  6. A. R. Benson, D. F. Gleich, and J. Leskovec. 2016. Higher-order organization of complex networks. Science, Vol. 353, 6295 (2016), 163--166.Google ScholarGoogle Scholar
  7. S. R. Bulò and M. Pelillo. 2013. A game-theoretic approach to hypergraph clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, 6 (2013), 1312--1327.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. T.-H. H. Chan, A. Louis, Z. G. Tang, and C. Zhang. 2018. Spectral properties of hypergraph Laplacian and approximation algorithms. J. ACM, Vol. 65, 3 (2018), 1--48.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Chen, H. Sun, D.P. Woodruff, and Q. Zhang. 2016. Communication-optimal distributed clustering. In Proceedings of Neural Information Processing Systems Conference. 3720--3728.Google ScholarGoogle Scholar
  10. Y. Chen, S. Khanna, and A. Nagda. 2020. Near-linear Size Hypergraph Cut Sparsifiers. In Proceedings of IEEE Symposium on Foundations of Computer Science. to appear.Google ScholarGoogle Scholar
  11. I. Chien, C.-Y. Lin, and I.-H. Wang. 2018. Community detection in hypergraphs: Optimal statistical limit and efficient algorithms. In Proceedings of International Conference on Artificial Intelligence and Statistics. 871--879.Google ScholarGoogle Scholar
  12. D. Ghoshdastidar and A. Dukkipati. 2014. Consistency of spectral partitioning of uniform hyper- graphs under planted partition model. In Proceedings of Neural Information Processing Systems Conference. 397--405.Google ScholarGoogle Scholar
  13. M. Hein, S. Setzer, L. Jost, and S. S. Rangapuram. 2013. The total variation on hypergraphs- learning on hypergraphs revisited. In Proceedings of Neural Information Processing Systems Conference. 2427--2435.Google ScholarGoogle Scholar
  14. M. Ikeda, A. Miyauchi, Y. Takai, and Y. Yoshida. 2018. Finding Cheeger cuts in hypergraphs via heat equation. arXiv preprint arXiv:1809.04396 (2018).Google ScholarGoogle Scholar
  15. M. Kapralov, R. Krauthgamer, J. Tardos, and Y. Yoshida. 2021. Towards Tight Bounds for Spectral Sparsification of Hypergraphs. In Proceedings of ACM Symposium on Theory of Computing .Google ScholarGoogle Scholar
  16. S. Kim, S. Nowozin, P. Kohli, and C. D. Yoo. 2011. Higher-order correlation clustering for image segmentation. In Proceedings of Neural Information Processing Systems Conference. 1530--1538.Google ScholarGoogle Scholar
  17. Y.T. Lee and H. Sun. 2017. An SDP-based algorithm for linear-sized spectral sparsification. In Proceedings of ACM Symposium on Theory of Computing. 678--687.Google ScholarGoogle Scholar
  18. M. Leordeanu and C. Sminchisescu. 2012. Efficient hypergraph clustering. In Proceedings of International Conference on Artificial Intelligence and Statistics. 676--684.Google ScholarGoogle Scholar
  19. P. Li, H. Dau, G. Puleo, and O. Milenkovic. 2017. Motif clustering and overlapping clustering for social network analysis. In Proceedings of IEEE International Conference on Computer Communications. 109--117.Google ScholarGoogle Scholar
  20. P. Li and O. Milenkovic. 2017. Inhomogeneous hypergraph clustering with applications. In Proceedings of Neural Information Processing Systems Conference. 2305--2315.Google ScholarGoogle Scholar
  21. T. Soma and Y. Yoshida. 2019. Spectral sparsification of hypergraphs. In Proceedings of ACM-SIAM Symposium on Discrete Algorithms. 2570--2581.Google ScholarGoogle Scholar
  22. D.A. Spielman and Shang-Hua Teng. 2011. Spectral sparsification of graphs. SIAM J. Comput., Vol. 40, 4 (2011), 981--1025.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Sun and L. Zanetti. 2019. Distributed graph clustering and sparsification. ACM Transactions on Parallel Computing, Vol. 6, 3 (2019), 17.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Y. Takai, A. Miyauchi, M. Ikeda, and Y. Yoshida. 2020. Hypergraph clustering based on PageRank. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1970--1978.Google ScholarGoogle Scholar
  25. D. P. Woodruff and Q. Zhang. 2017. When distributed computation is communication expensive. Distributed Computing, Vol. 30, 5 (2017), 309--323.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. H. Yin, A. R. Benson, J. Leskovec, and D. F. Gleich. 2017. Local higher-order graph clustering. In Proceedings of ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 555--564.Google ScholarGoogle Scholar
  27. Y. Yoshida. 2019. Cheeger inequalities for submodular transformations. In Proceedings of ACM-SIAM Symposium on Discrete Algorithms. 2582--2601.Google ScholarGoogle ScholarCross RefCross Ref
  28. C. Zhang, S. Hu, Z. G. Tang, and T-H. H. Chan. 2020. Re-revisiting learning on hypergraphs: Confidence interval, subgradient method, and extension to multiclass. IEEE Transactions on Knowledge and Data Engineering, Vol. 32, 3 (2020), 506--518.Google ScholarGoogle ScholarCross RefCross Ref
  29. D. Zhou, J. Huang, and B. Scholkopf. 2007. Learning with hypergraphs: Clustering, classification, and embedding. In Proceedings of Neural Information Processing Systems Conference. 1601--1608.Google ScholarGoogle Scholar
  30. C.J. Zhu, S. Storandt, K.-Y. Lam, S. Han, and J. Bi. 2019 a. Improved dynamic graph learning through fault-tolerant sparsification. In Proceedings of International Conference on Machine Learning. 7624--7633.Google ScholarGoogle Scholar
  31. C.J. Zhu, T. Zhu, K.-Y. Lam, S. Han, and J. Bi. 2019 b. Communication-optimal distributed dynamic graph clustering. In Proceedings of AAAI Conference on Artificial Intelligence. 5957--5964.Google ScholarGoogle Scholar
  32. J.Y. Zien, M.D.F. Schlag, and P.K. Chan. 1999. Multilevel spectral hypergraph partitioning with arbitrary vertex sizes. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, 9 (1999), 1389--1399.Google ScholarGoogle Scholar

Index Terms

  1. Communication Efficient Distributed Hypergraph Clustering

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
          July 2021
          2998 pages
          ISBN:9781450380379
          DOI:10.1145/3404835

          Copyright © 2021 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 11 July 2021

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

          Acceptance Rates

          Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader