skip to main content
10.1145/3340531.3411966acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Feature Fusion Based Subgraph Classification for Link Prediction

Published:19 October 2020Publication History

ABSTRACT

Link prediction, which centers on whether or not a pair of nodes is likely to be connected, is a fundamental problem in complex network analysis. Network-embedding-based link prediction has shown strong performance and robustness in previous studies on complex networks, recommendation systems, and knowledge graphs. This approach has certain drawbacks, however; namely, the hierarchical structure of a subgraph is ignored and the importance of different nodes is not distinguished. In this study, we established the Subgraph Hierarchy Feature Fusion (SHFF) model for link prediction. To probe the existence of links between node pairs, the SHFF first extracts a subgraph around the two nodes and learns a function to map the subgraph to a vector for subsequent classification. This reveals any link between the two target nodes. The SHFF learns a function to obtain a representation of the extracted subgraph by hierarchically aggregating the features of nodes in that subgraph, which is accomplished by grouping nodes with similar structures and assigning different importance to the nodes during the feature fusion process. We compared the proposed model against other state-of-the-art link-prediction methods on a wide range of data sets to find that it consistently outperforms them.

Skip Supplemental Material Section

Supplemental Material

3340531.3411966.mp4

mp4

252.8 MB

References

  1. Ackland, R., et al. Mapping the us political blogosphere: Are conservative bloggers more prominent? In BlogTalk Downunder 2005 Conference, Sydney (2005).Google ScholarGoogle Scholar
  2. Adamic, L. A., and Adar, E. Friends and neighbors on the web. Social networks 25, 3 (2003), 211--230.Google ScholarGoogle Scholar
  3. Aslam, J. A., Yilmaz, E., and Pavlu, V. The maximum entropy method for analyzing retrieval measures. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (New York, NY, USA, 2005), SIGIR '05, Association for Computing Machinery, pp. 27--34.Google ScholarGoogle Scholar
  4. Barabási, A.-L., and Albert, R. Emergence of scaling in random networks. science 286, 5439 (1999), 509--512.Google ScholarGoogle Scholar
  5. Barbieri, N., Bonchi, F., and Manco, G. Who to follow and why: link prediction with explanations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2014), pp. 1266--1275.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Brin, S., and Page, L. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN System 30, 1--7 (1998), 107--117.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cao, Z., Qin, T., Liu, T.-Y., Tsai, M.-F., and Li, H. Learning to rank: from pairwise approach to listwise approach. In Proceedings of the 24th International Conference on Machine Learning (2007), pp. 129--136.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Chen, H., Yin, H., Wang, W., Wang, H., Nguyen, Q. V. H., and Li, X. Pme: projected metric embedding on heterogeneous networks for link prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2018), pp. 1177--1186.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chowdhury, G. G. Introduction to modern information retrieval. Facet publishing, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dai, H., Dai, B., and Song, L. Discriminative embeddings of latent variable models for structured data. In Proceedings of the 33rd International Conference on Machine Learning (2016), pp. 2702--2711.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Duvenaud, D. K., Maclaurin, D., Iparraguirre, J., Bombarell, R., Hirzel, T., Aspuru-Guzik, A., and Adams, R. P. Convolutional networks on graphs for learning molecular fingerprints. In Proceedings of the Advances in Neural Information Processing Systems (2015), pp. 2224--2232.Google ScholarGoogle Scholar
  12. Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O., and Dahl, G. E. Neural message passing for quantum chemistry. In Proceedings of the 34th International Conference on Machine Learning-Volume 70 (2017), JMLR, pp. 1263--1272.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Grover, A., and Leskovec, J. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016), pp. 855--864.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jeh, G., and Widom, J. Simrank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2002), pp. 538--543.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Katz, L. A new status index derived from sociometric analysis. Psychometrika 18, 1 (1953), 39--43.Google ScholarGoogle ScholarCross RefCross Ref
  16. Kossinets, G. Effects of missing data in social networks. Social networks 28, 3 (2006), 247--268.Google ScholarGoogle Scholar
  17. Lee, J., Lee, I., and Kang, J. Self-attention graph pooling. arXiv preprint arXiv:1904.08082 (2019).Google ScholarGoogle Scholar
  18. Liben-Nowell, D., and Kleinberg, J. The link-prediction problem for social networks. Journal of the American society for information science and technology 58, 7 (2007), 1019--1031.Google ScholarGoogle Scholar
  19. Lichtenwalter, R. N., Lussier, J. T., and Chawla, N. V. New perspectives and methods in link prediction. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2010), pp. 243--252.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Ma, Y., Wang, S., Aggarwal, C. C., and Tang, J. Graph convolutional networks with eigenpooling. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2019), pp. 723--731.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Negi, S., and Chaudhury, S. Link prediction in heterogeneous social networks. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (2016), pp. 609--617.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Newman, M. Networks: An Introduction. Oxford University Press, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  23. Newman, M. E. Clustering and preferential attachment in growing networks. Physical review E 64, 2 (2001), 025102.Google ScholarGoogle Scholar
  24. Newman, M. E. Finding community structure in networks using the eigenvectors of matrices. Physical review E 74, 3 (2006), 036104.Google ScholarGoogle Scholar
  25. Pal, S. K., and Mitra, S. Multilayer perceptron, fuzzy sets, and classification. IEEE Transactions on Neural Networks 3, 5 (1992), 683--697.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Perozzi, B., Al-Rfou, R., and Skiena, S. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2014), pp. 701--710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Provost, F., and Fawcett, T. Analysis and visualization of classifier performance: Comparison under imprecise class and cost distributions. In Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining (1997), pp. 43--48.Google ScholarGoogle Scholar
  28. Ribeiro, L. F., Saverese, P. H., and Figueiredo, D. R. struc2vec: Learning node representations from structural identity. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2017), pp. 385--394.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Rossi, R. A., and Ahmed, N. K. The network data repository with interactive graph analytics and visualization. In Proceedings of the 29th AAAI Conference on Artificial Intelligence, pp. 4292--4293.Google ScholarGoogle Scholar
  30. Spring, N., Mahajan, R., and Wetherall, D. Measuring isp topologies with rocketfuel. ACM SIGCOMM Computer Communication Review 32, 4 (2002), 133--145.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., and Mei, Q. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web (2015), pp. 1067--1077.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Velivc ković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., and Bengio, Y. Graph attention networks. arXiv preprint arXiv:1710.10903 (2017).Google ScholarGoogle Scholar
  33. Von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S. G., Fields, S., and Bork, P. Comparative assessment of large-scale data sets of protein--protein interactions. Nature 417, 6887 (2002), 399--403.Google ScholarGoogle ScholarCross RefCross Ref
  34. Watts, D. J., and Strogatz, S. H. Collective dynamics of 'smallworld'networks. nature, 393 (6684): 440--442, 1998. Adaptive degree penalization for link prediction (2018).Google ScholarGoogle Scholar
  35. Wu, J., He, J., and Xu, J. Net: Degree-specific graph neural networks for node and graph classification. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2019), pp. 406--415.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Yu, H., Braun, P., Yildirim, M. A., Lemmens, I., Venkatesan, K., Sahalie, J., Hirozane-Kishikawa, T., Gebreab, F., Li, N., Simonis, N., et al. High-quality binary protein interaction map of the yeast interactome network. Science 322, 5898 (2008), 104--110.Google ScholarGoogle ScholarCross RefCross Ref
  37. Zhang, J., Yu, P. S., and Zhou, Z.-H. Meta-path based multi-network collective link prediction. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2014), pp. 1286--1295.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Zhang, M., and Chen, Y. Weisfeiler-lehman neural machine for link prediction. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2017), pp. 575--583.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Zhang, M., and Chen, Y. Link prediction based on graph neural networks. In Advances in Neural Information Processing Systems (2018), pp. 5165--5175.Google ScholarGoogle Scholar
  40. Zhang, M., Cui, Z., Neumann, M., and Chen, Y. An end-to-end deep learning architecture for graph classification. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (2018), pp. 4438--4445.Google ScholarGoogle ScholarCross RefCross Ref
  41. Zhao, Z., Gao, B., Zheng, V. W., Cai, D., He, X., and Zhuang, Y. Link prediction via ranking metric dual-level attention network learning. In IJCAI (2017), pp. 3525--3531.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Zhou, F., Wu, B., Yang, Y., Trajcevski, G., Zhang, K., and Zhong, T. Vec2link: Unifying heterogeneous data for social link prediction. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (2018), pp. 1843--1846.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Feature Fusion Based Subgraph Classification for Link Prediction

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management
          October 2020
          3619 pages
          ISBN:9781450368599
          DOI:10.1145/3340531

          Copyright © 2020 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 19 October 2020

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader