Skip to main content
Log in

Semantic-aware heterogeneous information network embedding with incompatible meta-paths

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Heterogeneous information network (HIN) embedding represents heterogeneous nodes as vectors in the low-dimensional space. Meta-path is used to measure the nodes similarity to guide HIN embedding. Existing works assume that different meta-paths share the same semantic space and directly fuse the different mate-paths for node similarity calculation. This ignores the incompatibility of different meta-paths, which cannot reflect the real relationship between nodes. To solve the problems of existing works, a novel S emantic-A ware H IN E mbedding (SAHE) is proposed to fuse incompatible meta-paths for node similarity measurement. The key idea of the proposed method is to measure the relative similarity relationship on each meta-path in its own semantic space, and aggregate these similarity relationships to obtain the node similarity to calculate HIN embedding. The kendall tau distance is used to aggregate the different similarity relationship in multiple semantic spaces. The semantic preference is extracted as a constraint to optimize the aggregated similarity matrix. The Kullback-Leibler Divergence (KL Divergence) is used to learn nodes embedding by measuring the node similarity distribution in the embedded space. Experiments on three real HIN datasets verify that the superiority of the proposed model is superior to other state-of-the-art methods on the node classification and the node clustering tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5

Similar content being viewed by others

References

  1. Amiri, M. M., Gündüz, D.: Machine learning at the wireless edge: Distributed stochastic gradient descent over-the-air. IEEE Trans. Signal Process. 68, 2155–2169 (2020). https://doi.org/10.1109/TSP.2020.2981904

    Article  MathSciNet  Google Scholar 

  2. Cen, Y., Zou, X., Zhang, J., Yang, H., Zhou, J., Tang, J.: Representation learning for attributed multiplex heterogeneous network. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1358–1368. https://doi.org/10.1145/3292500.3330964 (2019)

  3. Chen, H., Yin, H., Chen, T., Wang, W., Li, X., Hu, X.: Social boosted recommendation with folded bipartite network embedding. IEEE Transactions on Knowledge and Data Engineering. https://doi.org/10.1109/TKDE.2020.2982878 (2020)

  4. Chen, X., Yu, G., Wang, J., Domeniconi, C., Li, Z., Zhang, X.: Activehne: Active heterogeneous network embedding. arXiv:1905.05659 (2019)

  5. Do, P., Pham, P.: Dw-pathsim: a distributed computing model for topic-driven weighted meta-path-based similarity measure in a large-scale content-based heterogeneous information network. Journal of Information and Telecommunication 3(1), 19–38 (2019). https://doi.org/10.1080/24751839.2018.1516714

    Article  Google Scholar 

  6. Dong, Y., Chawla, N. V., Swami, A.: metapath2vec: Scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining, pp. 135–144. https://doi.org/10.1145/3097983.3098036 (2017)

  7. Fan, S., Zhu, J., Han, X., Shi, C., Hu, L., Ma, B., Li, Y.: Metapath-guided heterogeneous graph neural network for intent recommendation. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2478–2486. https://doi.org/10.1145/3292500.3330673 (2019)

  8. Fu, T.y., Lee, W.C., Lei, Z.: Hin2vec: Explore meta-paths in heterogeneous information networks for representation learning. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 1797–1806. https://doi.org/10.1145/3132847.3132953 (2017)

  9. Fu, Y., Xiong, Y., Philip, S. Y., Tao, T., Zhu, Y.: Metapath enhanced graph attention encoder for hins representation learning. In: 2019 IEEE International Conference on Big Data (Big Data). https://doi.org/10.1109/BigData47090.2019.9006097, pp 1103–1110, IEEE (2019)

  10. Grover, A., Leskovec, J.: node2vec: Scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 855–864. https://doi.org/10.1145/2939672.2939754 (2016)

  11. Hussein, R., Yang, D., Cudré-Mauroux, P.: Are meta-paths necessary? revisiting heterogeneous graph embeddings. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pp. 437–446. https://doi.org/10.1145/3269206.3271777 (2018)

  12. Jiang, Z., Gao, Z., Lan, J., Yang, H., Lu, Y., Liu, X.: Task-oriented genetic activation for large-scale complex heterogeneous graph embedding. In: Proceedings of The Web Conference 2020, pp. 1581–1591. https://doi.org/10.1145/3366423.3380230 (2020)

  13. Kipf, T. N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv:1609.02907 (2016)

  14. Kondratev, A. Y., Nesterov, A. S.: Measuring majority power and veto power of voting rules. Public Choice 183(1), 187–210 (2020). https://doi.org/10.1007/s11127-019-00697-1

    Article  Google Scholar 

  15. Molaei, S., Zare, H., Veisi, H.: Deep learning approach on information diffusion in heterogeneous networks. Knowl.-Based Syst. 189, 105153 (2020). https://doi.org/10.1016/j.knosys.2019.105153

    Article  Google Scholar 

  16. Park, C., Kim, D., Han, J., Yu, H.: Unsupervised attributed multiplex network embedding. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp 5371–5378 (2020)

  17. Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 701–710. https://doi.org/10.1145/2623330.2623732 (2014)

  18. Pham, P., Do, P.: W-metapath2vec: The topic-driven meta-path-based model for large-scaled content-based heterogeneous information network representation learning. Expert Syst. Appl. 123, 328–344 (2019). https://doi.org/10.1016/j.eswa.2019.01.015

    Article  Google Scholar 

  19. Pradhan, T., Pal, S.: Cnaver: A content and network-based academic venue recommender system. Knowl.-Based Syst. 189, 105092 (2020). https://doi.org/10.1016/j.knosys.2019.105092

    Article  Google Scholar 

  20. Rahmani, H. A., Aliannejadi, M., Mirzaei Zadeh, R., Baratchi, M., Afsharchi, M., Crestani, F.: Category-aware location embedding for point-of-interest recommendation. In: Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval. https://doi.org/10.1145/3341981.3344240, pp 173–176 (2019)

  21. Rioux, G., Scarvelis, C., Choksi, R., Hoheisel, T., Marechal, P.: Blind deblurring of barcodes via kullback-leibler divergence. IEEE Trans. Pattern Anal. Mach. Intell. https://doi.org/10.1109/TPAMI.2019.2927311 (2019)

  22. Shang, J., Qu, M., Liu, J., Kaplan, L. M., Han, J., Peng, J.: Meta-path guided embedding for similarity search in large-scale heterogeneous information networks. arXiv:1610.09769 (2016)

  23. Shi, C., Hu, B., Zhao, W. X., Philip, S. Y.: Heterogeneous information network embedding for recommendation. IEEE Trans. Knowl. Data Eng. 31(2), 357–370 (2018)

    Article  Google Scholar 

  24. Shi, Y., Gui, H., Zhu, Q., Kaplan, L., Han, J.: Aspem: Embedding learning by aspects in heterogeneous information networks. In: Proceedings of the 2018 SIAM International Conference on Data Mining. https://doi.org/10.1137/1.9781611975321.16, pp 144–152. SIAM (2018)

  25. Shi, Y., Zhu, Q., Guo, F., Zhang, C., Han, J.: Easing embedding learning by comprehensive transcription of heterogeneous information networks. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. https://doi.org/10.1145/3219819.3220006, pp 2190–2199 (2018)

  26. Song, Y., Deng, Y.: Divergence measure of belief function and its application in data fusion. IEEE Access 7, 107465–107472 (2019)

    Article  Google Scholar 

  27. Sun, Y., Han, J., Yan, X., Yu, P. S., Wu, T.: Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. Proceedings of the VLDB Endowment 4(11), 992–1003 (2011)

    Article  Google Scholar 

  28. Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: Large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web. https://doi.org/10.1145/2736277.2741093, pp 1067–1077 (2015)

  29. Tu, K., Ma, J., Cui, P., Pei, J., Zhu, W.: Autone: Hyperparameter optimization for massive network embedding. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. https://doi.org/10.1145/3292500.3330848, pp 216–225 (2019)

  30. Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., Bengio, Y.: Graph attention networks. arXiv:1710.10903 (2017)

  31. Vieira, C., Ribeiro, F., Vaz de Melo, P. O., Benevenuto, F., Zagheni, E.: Using facebook data to measure cultural distance between countries: The case of brazilian cuisine. In: Proceedings of The Web Conference 2020. https://doi.org/10.1145/3366423.3380082, pp 3091–3097 (2020)

  32. Wang, X., Ji, H., Shi, C., Wang, B., Ye, Y., Cui, P., Yu, P. S.: Heterogeneous graph attention network. In: The World Wide Web Conference. https://doi.org/10.1145/3308558.3313562, pp 2022–2032 (2019)

  33. Wang, X., Lu, Y., Shi, C., Wang, R., Cui, P., Mou, S.: Dynamic heterogeneous information network embedding with meta-path based proximity. IEEE Transactions on Knowledge and Data Engineering (2020)

  34. Xu, L., Wei, X., Cao, J., Yu, P. S.: Embedding of embedding (eoe) joint embedding for coupled heterogeneous networks. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. https://doi.org/10.1145/3018661.3018723, pp 741–749 (2017)

  35. Yang, D., Wang, S., Li, C., Zhang, X., Li, Z.: From properties to links: Deep network embedding on incomplete graphs. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp 367–376 (2017)

  36. Yang, L., Xiao, Z., Jiang, W., Wei, Y., Hu, Y., Wang, H.: Dynamic heterogeneous graph embedding using hierarchical attentions. In: European Conference on Information Retrieval, pp 425–432. Springer (2020)

  37. Yu, Y., Wang, Z., Yuan, B.: An input-aware factorization machine for sparse prediction. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp 1466–1472. AAAI Press (2019)

  38. Zhang, C., Wang, G., Yu, B., Xie, Y., Pan, K.: Proximity-aware heterogeneous information network embedding. Knowledge-Based Systems, 105468. https://doi.org/10.1016/j.knosys.2019.105468 (2020)

Download references

Acknowledgements

This research is supported by National Natural Science Foundation of China (No. 61672284), Natural Science Foundation of Jiangsu Province (No. BK20171418), Postdoctoral Science Foundation of China (No. 2016M591841) and Jiangsu Planned Projects for Postdoctoral Research Funds (No. 1601225C).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Weiwei Yuan.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zheng, S., Guan, D. & Yuan, W. Semantic-aware heterogeneous information network embedding with incompatible meta-paths. World Wide Web 25, 1–21 (2022). https://doi.org/10.1007/s11280-021-00903-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-021-00903-5

Keywords

Navigation