Skip to main content

A Weighted Similarity Measure Based on Meta Structure in Heterogeneous Information Networks

  • Conference paper
  • First Online:
Knowledge Management and Acquisition for Intelligent Systems (PKAW 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11016))

Included in the following conference series:

  • 1755 Accesses

Abstract

Evaluating the similarity between two objects in heterogeneous information network is a significant part of information science. The existing meta-structure based similarity measures only consider one meta-structure, which leads to a loss of accuracy. Based on the meta-structure, this paper proposes a weighted method to tackle the problem. We put forward a weighting algorithm that determines the value of weight to each meta-structure according to the set of the user’s preferences, and to compute the similarity value, we convert meta-structure into meta-path and use a novel meta-path based similarity measure StruSim. The top-k similarity research experiment is conducted to prove the effectiveness of the novel method. Using the measure nDCG, we conclude that StruSim performs better than PathSim, HeteSim, and AvgSim. And the multiple meta-structure methods are better than BSCSE and unweighted meta-path based methods. At last, we propose an interpolation and derivation method to search the optimal bias factor in StruSim to achieve a better performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Fouss, F., Pirotte, A., Renders, J.M., Saerens, M.: Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Trans. Knowl. Data Eng. 19(3), 355–369 (2007)

    Article  Google Scholar 

  2. Huang, Z., Zheng, Y., Cheng, R., Sun, Y., Mamoulis, N., Li, X.: Meta structure: computing relevance in large heterogeneous information networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1595–1604. ACM (2016)

    Google Scholar 

  3. Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. (TOIS) 20(4), 422–446 (2002)

    Article  Google Scholar 

  4. Lao, N., Cohen, W.W.: Fast query execution for retrieval models based on path-constrained random walks. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 881–888. ACM (2010)

    Google Scholar 

  5. Lao, N., Cohen, W.W.: Relational retrieval using a combination of path-constrained random walks. Mach. Learn. 81(1), 53–67 (2010)

    Article  MathSciNet  Google Scholar 

  6. Ley, M.: DBLP computer science bibliography (2005)

    Google Scholar 

  7. Meng, C., Cheng, R., Maniu, S., Senellart, P., Zhang, W.: Discovering meta-paths in large heterogeneous information networks. In: Proceedings of the 24th International Conference on World Wide Web, pp. 754–764. International World Wide Web Conferences Steering Committee (2015)

    Google Scholar 

  8. Meng, X., Shi, C., Li, Y., Zhang, L., Wu, B.: Relevance measure in large-scale heterogeneous networks. In: Chen, L., Jia, Y., Sellis, T., Liu, G. (eds.) APWeb 2014. LNCS, vol. 8709, pp. 636–643. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11116-2_61

    Chapter  Google Scholar 

  9. Shi, C., Kong, X., Huang, Y., Yu, P.S., Wu, B.: HeteSim: a general framework for relevance measure in heterogeneous networks. IEEE Trans. Knowl. Data Eng. 26(10), 2479–2492 (2014)

    Article  Google Scholar 

  10. Shi, C., Yu, P.S.: Heterogeneous Information Network Analysis and Applications. Data Analytics. Springer, New York (2017). https://doi.org/10.1007/978-3-319-56212-4

    Book  Google Scholar 

  11. Shi, C., Zhang, Z., Luo, P., Yu, P.S., Yue, Y., Wu, B.: Semantic path based personalized recommendation on weighted heterogeneous information networks. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM 2015, pp. 453–462. ACM (2015)

    Google Scholar 

  12. Sun, Y., Han, J.: Meta-path-based search and mining in heterogeneous information networks. Tsinghua Sci. Technol. 18(4), 329–338 (2013)

    Article  Google Scholar 

  13. Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: PathSim: meta path-based top-k similarity search in heterogeneous information networks. PVLDB 4(11), 992–1003 (2011)

    Google Scholar 

  14. Sun, Y., Norick, B., Han, J., Yan, X., Yu, P.S., Yu, X.: Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. In: Yang, Q., Agarwal, D., Pei, J. (eds.) The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2012, Beijing, China, 12–16 August 2012, pp. 1348–1356. ACM (2012)

    Google Scholar 

  15. Tang, Z.P., Yang, Y., Bu, Y.: Weighted-pathSim: similarity measure for plot-based movie recommendation

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhaochen Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, Z., Wang, H. (2018). A Weighted Similarity Measure Based on Meta Structure in Heterogeneous Information Networks. In: Yoshida, K., Lee, M. (eds) Knowledge Management and Acquisition for Intelligent Systems. PKAW 2018. Lecture Notes in Computer Science(), vol 11016. Springer, Cham. https://doi.org/10.1007/978-3-319-97289-3_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97289-3_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97288-6

  • Online ISBN: 978-3-319-97289-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics