skip to main content
10.1145/3459637.3482437acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

HyperGraph Convolution Based Attributed HyperGraph Clustering

Authors Info & Claims
Published:30 October 2021Publication History

ABSTRACT

Attributed Graph Clustering (AGC) and Attributed Hypergraph Clustering (AHC) are important topics in graph mining with many applications. For AGC, amongst the unsupervised methods that combine the graph structure with node attributes, graph convolution has been shown to achieve impressive results. However, the effects of graph convolution on AGC have not yet been adequately studied. In this paper, we show that graph convolution attempts to find the best trade-off between node attribute distance and the number of inter-cluster edges. On the one hand, we show that compared to clustering node attributes directly, graph convolution produces a greater distance between node attributes in the same cluster and a smaller distance between node attributes in different clusters (which is detrimental for clustering). On the other hand, we show that graph convolution benefits clustering by considerably reducing the number of edges among different clusters. We then extend our result on AGC to AHC and leverage the hypergraph convolution to propose an unsupervised, fast, and memory-efficient algorithm (GRAC) for AHC, which achieves excellent performance on popular supervised clustering measures.

Skip Supplemental Material Section

Supplemental Material

video-CIKM21-rgfp1475.mp4

mp4

270.2 MB

References

  1. Reid Andersen, Fan Chung, and Kevin Lang. 2007. Using PageRank to Locally Partition a Graph. Internet Mathematics 4 (01 2007), 35--64. https://doi.org/10.1080/15427951.2007.10129139Google ScholarGoogle Scholar
  2. Deng Cai, Xiaofei He, Jiawei Han, and Thomas S Huang. 2010. Graph regularized nonnegative matrix factorization for data representation. IEEE transactions on pattern analysis and machine intelligence 33, 8 (2010), 1548--1560. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T.-H. Hubert Chan and Zhibin Liang. 2020. Generalizing the hypergraph Laplacian via a diffusion process with mediators. Theor. Comput. Sci. 806 (2020), 416--428. https://doi.org/10.1016/j.tcs.2019.07.024Google ScholarGoogle ScholarCross RefCross Ref
  4. T.-H. Hubert Chan, Anand Louis, Zhihao Gavin Tang, and Chenzi Zhang. 2018. Spectral Properties of Hypergraph Laplacian and Approximation Algorithms. J. ACM 65, 3, Article 15 (March 2018), 48 pages. https://doi.org/10.1145/3178123 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Olivier Delalleau, Yoshua Bengio, and Nicolas Le Roux. 2005. Efficient NonParametric Function Induction in Semi-Supervised Learning. In Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. R5), Robert G. Cowell and Zoubin Ghahramani (Eds.). PMLR, 96--103. http://proceedings.mlr.press/r5/delalleau05a.html Reissued by PMLR on 30 March 2021.Google ScholarGoogle Scholar
  6. Inderjit Dhillon, Yuqiang Guan, and Brian Kulis. 2007. Weighted Graph Cuts without Eigenvectors A Multilevel Approach. IEEE transactions on pattern analysis and machine intelligence 29 (12 2007), 1944--57. https://doi.org/10.1109/TPAMI. 2007.1115 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Inderjit S. Dhillon, Yuqiang Guan, and Brian Kulis. 2004. Kernel k-means: spectral clustering and normalized cuts. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Washington, USA, August 22--25, 2004, Won Kim, Ron Kohavi, Johannes Gehrke, and William DuMouchel (Eds.). ACM, 551--556. https://doi.org/10.1145/1014052.1014118 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Rundong Du, Barry L. Drake, and Haesun Park. 2017. Hybrid Clustering based on Content and Connection Structure using Joint Nonnegative Matrix Factorization. CoRR abs/1703.09646 (2017). arXiv:1703.09646 http://arxiv.org/abs/1703.09646Google ScholarGoogle Scholar
  9. Yifan Feng, Haoxuan You, Zizhao Zhang, Rongrong Ji, and Yue Gao. 2018. Hypergraph Neural Networks. AAAI 2019 (2018).Google ScholarGoogle Scholar
  10. Yifan Feng, Haoxuan You, Zizhao Zhang, Rongrong Ji, and Yue Gao. 2018. Hypergraph Neural Networks. AAAI 2019 (2018).Google ScholarGoogle Scholar
  11. Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Ting Guo, Shirui Pan, Xingquan Zhu, and Chengqi Zhang. 2018. CFOND: consensus factorization for co-clustering networked data. IEEE Transactions on Knowledge and Data Engineering 31, 4 (2018), 706--719. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in neural information processing systems. 1024--1034. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Po-Yao Huang, Robert Frederking, et al. 2019. RWR-GAE: Random Walk Regularization for Graph Auto Encoders. arXiv preprint arXiv:1908.04003 (2019).Google ScholarGoogle Scholar
  15. Jianbo Shi and J. Malik. 2000. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 8 (2000), 888--905. https://doi.org/10.1109/34.868688 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Jiang Jianwen, Wei Yuxuan, Feng Yifan, Cao Jingxuan, and Gao Yue. 2019. Dynamic Hypergraph Neural Networks. In Proceedings of International Joint Conferences on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Vassilis Kalofolias. 2016. How to Learn a Graph from Smooth Signals. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 51), Arthur Gretton and Christian C. Robert (Eds.). PMLR, Cadiz, Spain, 920--929. http://proceedings.mlr.press/v51/kalofolias16.htmlGoogle ScholarGoogle Scholar
  18. Thomas N Kipf and Max Welling. 2016. Variational Graph Auto-Encoders. NIPS Workshop on Bayesian Deep Learning (2016).Google ScholarGoogle Scholar
  19. Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations (ICLR).Google ScholarGoogle Scholar
  20. Da Kuang, Chris Ding, and Haesun Park. 2012. Symmetric nonnegative matrix factorization for graph clustering. In Proceedings of the 2012 SIAM international conference on data mining. SIAM, 106--117.Google ScholarGoogle ScholarCross RefCross Ref
  21. Da Kuang, Chris Ding, and Haesun Park. 2012. Symmetric nonnegative matrix factorization for graph clustering. In Proceedings of the 2012 SIAM international conference on data mining. SIAM, 106--117.Google ScholarGoogle ScholarCross RefCross Ref
  22. Tarun Kumar, Sankaran Vaidyanathan, Harini Ananthapadmanabhan, Srinivasan Parthasarathy, and Balaraman Ravindran. 2018. Hypergraph Clustering: A Modularity Maximization Approach. CoRR abs/1812.10869 (2018). arXiv:1812.10869 http://arxiv.org/abs/1812.10869Google ScholarGoogle Scholar
  23. Marius Leordeanu and Cristian Sminchisescu. 2012. Efficient Hypergraph Clustering. AISTATS (01 2012).Google ScholarGoogle Scholar
  24. D. Li, Z. Xu, S. Li, and X. Sun. 2013. Link prediction in social networks based on hypergraph. WWW 2013 Companion - Proceedings of the 22nd International Conference on World Wide Web (01 2013), 41--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Pan Li and Olgica Milenkovic. 2017. Inhomogeneous Hypergraph Clustering with Applications. In Advances in Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc., 2308--2318. https://proceedings.neurips. cc/paper/2017/file/a50abba8132a77191791390c3eb19fe7-Paper.pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Q. Li, Z. Han, and X.-M. Wu. 2018. Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning. In The Thirty-Second AAAI Conference on Artificial Intelligence. AAAI.Google ScholarGoogle ScholarCross RefCross Ref
  27. Qimai Li, Xiao-Ming Wu, Han Liu, Xiaotong Zhang, and Zhichao Guan. 2019. Label efficient semi-supervised learning via graph filtering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9582--9591.Google ScholarGoogle ScholarCross RefCross Ref
  28. Ulrike Luxburg. 2004. A Tutorial on Spectral Clustering. Statistics and Computing 17 (01 2004), 395--416. https://doi.org/10.1007/s11222-007--9033-z Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. James MacQueen et al. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, Vol. 1. Oakland, CA, USA, 281--297.Google ScholarGoogle Scholar
  30. James MacQueen et al. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, Vol. 1. Oakland, CA, USA, 281--297.Google ScholarGoogle Scholar
  31. Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, Lina Yao, and Chengqi Zhang. 2018. Adversarially Regularized Graph Autoencoder for Graph Embedding.. In IJCAI. 2609--2615. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. P. Perona and W. Freeman. 1998. A Factorization Approach to Grouping. In in European Conference on Computer Vision. 655--670. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 701--710. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. J.A. Rodriguez. 2002. On the Laplacian Eigenvalues and Metric Parameters of Hypergraphs. Linear and Multilinear Algebra 50, 1 (2002), 1--14. https://doi.org/10.1080/03081080290011692 arXiv:https://doi.org/10.1080/03081080290011692Google ScholarGoogle ScholarCross RefCross Ref
  35. Shota Saito, Danilo P. Mandic, and Hideyuki Suzuki. 2017. Hypergraph p-Laplacian: A Differential Geometry View. CoRR abs/1711.08171 (2017). arXiv:1711.08171 http://arxiv.org/abs/1711.08171Google ScholarGoogle Scholar
  36. D. I. Shuman, S. K. Narang, P. Frossard, A. Ortega, and P. Vandergheynst. 2013. The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Processing Magazine 30, 3 (2013), 83--98. https://doi.org/10.1109/MSP.2012.2235192Google ScholarGoogle ScholarCross RefCross Ref
  37. Yuuki Takai, Atsushi Miyauchi, Masahiro Ikeda, and Yuichi Yoshida. 2020. Hypergraph Clustering Based on PageRank. 1970--1978. https://doi.org/10.1145/3394486.3403248Google ScholarGoogle Scholar
  38. Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2018. Graph Attention Networks. International Conference on Learning Representations (2018). https://openreview.net/forum?id= rJXMpikCZ accepted as poster.Google ScholarGoogle Scholar
  39. Chun Wang, Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. Attributed Graph Clustering: a Deep Attentional Embedding approach. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, Sarit Kraus (Ed.). Association for the Advancement of Artificial Intelligence (AAAI), United States of America, 3670--3676. https://doi.org/10.24963/ijcai.2019/509 Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Chun Wang, Shirui Pan, Ruiqi Hu, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. Attributed Graph Clustering: a Deep Attentional Embedding approach. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, Sarit Kraus (Ed.). Association for the Advancement of Artificial Intelligence (AAAI), United States of America, 3670--3676. https://doi.org/10.24963/ijcai.2019/509 Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Chun Wang, Shirui Pan, Guodong Long, Xingquan Zhu, and Jing Jiang. 2017. Mgae: Marginalized graph autoencoder for graph clustering. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 889--898. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Joyce Whang, Rundong Du, Sangwon Jung, Geon Lee, Barry Drake, Qingqing Liu, Seonggoo Kang, and Haesun Park. 2020. MEGA: multi-view semi-supervised clustering of hypergraphs. Proceedings of the VLDB Endowment 13 (01 2020), 698--711. https://doi.org/10.14778/3377369.3377378 Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Felix Wu, Amauri Souza, Tianyi Zhang, Christopher Fifty, Tao Yu, and Kilian Weinberger. 2019. Simplifying Graph Convolutional Networks. In Proceedings of the 36th International Conference on Machine Learning. PMLR, 6861--6871.Google ScholarGoogle Scholar
  44. Felix Wu, Amauri Souza, Tianyi Zhang, Christopher Fifty, Tao Yu, and Kilian Weinberger. 2019. Simplifying Graph Convolutional Networks. In Proceedings of the 36th International Conference on Machine Learning. PMLR, 6861--6871.Google ScholarGoogle Scholar
  45. Rongkai Xia, Y. Pan, Lei Du, and J. Yin. 2014. Robust Multi-View Spectral Clustering via Low-Rank and Sparse Decomposition. In AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. 2019. How Powerful are Graph Neural Networks?. In International Conference on Learning Representations. https://openreview.net/forum?id=ryGs6iA5KmGoogle ScholarGoogle Scholar
  47. Zhiqiang Xu, Yiping Ke, Yi Wang, Hong Cheng, and James Cheng. 2012. A model-based approach to attributed graph clustering. Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, SIGMOD '12 (05 2012). https://doi.org/10.1145/2213836.2213894 Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Zhiqiang Xu, Yiping Ke, Yi Wang, Hong Cheng, and James Cheng. 2014. GBAGC: A General Bayesian Framework for Attributed Graph Clustering. ACM Trans. Knowl. Discov. Data 9 (2014), 5:1--5:43. http://dblp.uni-trier.de/db/journals/tkdd/tkdd9.html#XuK0CC14 Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Naganand Yadati, Madhav Nimishakavi, Prateek Yadav, Vikram Nitin, Anand Louis, and Partha Talukdar. 2019. HyperGCN: A New Method For Training Graph Convolutional Networks on Hypergraphs. In Advances in Neural Information Processing Systems (NeurIPS) 32. Curran Associates, Inc., 1509--1520. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Tianbao Yang, Rong Jin, Yun Chi, and Shenghuo Zhu. 2009. Combining Link and Content for Community Detection: A Discriminative Approach. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Paris, France) (KDD '09). Association for Computing Machinery, New York, NY, USA, 927--936. https://doi.org/10.1145/1557019.1557120 Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Xiaotong Zhang, Han Liu, Qimai Li, and Xiao-Ming Wu. 2019. Attributed Graph Clustering via Adaptive Graph Convolution. 4327--4333. https://doi.org/10.24963/ijcai.2019/601 Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Lingxiao Zhao and Leman Akoglu. 2020. PairNorm: Tackling Oversmoothing in GNNs. In International Conference on Learning Representations. https://openreview.net/forum?id=rkecl1rtwBGoogle ScholarGoogle Scholar
  53. Dengyong Zhou, Jiayuan Huang, and Bernhard Schölkopf. 2006. Learning with Hypergraphs: Clustering, Classification, and Embedding. Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference, 1601--1608 (2007) 19, 1601--1608. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. HyperGraph Convolution Based Attributed HyperGraph Clustering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
      October 2021
      4966 pages
      ISBN:9781450384469
      DOI:10.1145/3459637

      Copyright © 2021 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 30 October 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader