Skip to main content
Log in

Learning to Predict Links by Integrating Structure and Interaction Information in Microblogs

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Link prediction in microblogs by using unsupervised methods has been studied extensively in recent years, which aims to find an appropriate similarity measure between users in the network. However, the measures used by existing work lack a simple way to incorporate the structure of the network and the interactions between users. This leads to the gap between the predictive result and the ground truth value. For example, the F1-measure created by the best method is around 0.2. In this work, we firstly discover the gap and prove its existence. To narrow this gap, we define the retweeting similarity to measure the interactions between users in Twitter, and propose a structural-interaction based matrix factorization model for following-link prediction. Experiments based on the real-world Twitter data show that our model outperforms state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Wasserman S, Faust K. Social Network Analysis: Methods and Applications. Cambridge University Press, Nov. 1994.

  2. Gu Q, Zhou J, Ding C. Collaborative filtering: Weighted nonnegative matrix factorization incorporating user and item graphs. In Proc. SDM, April 29-May 1, 2010, pp. 199–210.

  3. Backstrom L, Leskovec J. Supervised random walks: Predicting and recommending links in social networks. In Proc. the 4th WSDM, Feb. 2011, pp. 635–644.

  4. Leskovec J, Huttenlocher D, Kleinberg J. Predicting positive and negative links in online social networks. In Proc. the 19th WWW, Apr. 2010, pp. 641–650.

  5. Adamic L, Adar E. Friends and neighbors on the web. Social Networks, 2003, 25(3): 211–230.

  6. Newman M E. Clustering and preferential attachment in growing networks. Phys. Rev. E, 2001, 64(2): Article No. 025102.

  7. Katz L. A new status index derived from sociometric analysis. Psychometrika, 1953, 18(1): 39–43.

  8. Sadilek A, Kautz H, Bigham J P. Finding your friends and following them to where you are. In Proc. the 5th WSDM, Feb. 2012, pp. 723–732.

  9. Hopcroft J, Lou T, Tang J. Who will follow you back? Reciprocity relationship prediction. In Proc. the 20th CIKM, Oct. 2011, pp. 1137–1146.

  10. Lou T, Tang J, Hopcroft J, Fang Z, Ding X. Learning to predict reciprocity and triadic closure in social networks. ACM Transactions on Knowledge Discovery from Data, 2013, 7(2): 5:1–5:25.

  11. Yin D, Hong L, Davison B D. Structural link analysis and prediction in microblogs. In Proc. the 20th CIKM, Oct. 2011, pp. 1163–1168.

  12. Lichtenwalter R, Lussier J, Chawla N. New perspectives and methods in link prediction. In Proc. the 16th KDD, Jul. 2010, pp. 243–252.

  13. Akasaka R, Grafe P, Kondo M. ‘Me Too’ 2.0: An analysis of viral retweets on the Twitter-sphere, 2010. http://snap.stanford.edu/class/cs224w-2010/proj2010/13_projectFinal.pdf, May 2015.

  14. Kwak H, Lee C, Park H, Moon S. What is Twitter, a social network or a news media? In Proc. the 19th WWW, Apr. 2010, pp. 591–600.

  15. Koren Y. Factorization meets the neighborhood: A multifaceted collaborative filtering model. In Proc. the 14th KDD, Aug. 2008, pp. 426–434.

  16. Koren Y, Bell R, Volinsky C. Matrix factorization techniques for recommender system. Computer, 2009, 42(8): 30–37.

  17. Zhou T, Shan H, Banerjee A, Sapiroy G. Kernelized probabilistic matrix factorization: Exploiting graphs and side information. In Proc. the 12th SDM, Aug. 2012, pp. 403–414.

  18. Menon A K, Elkan C. Link prediction via matrix factorization. In Proc. ECML PKDD, Sept. 2011, pp. 437–452.

  19. Zhang J, Wang C, Wang J, Yu P S. LaFT-tree: Perceiving the expansion trace of one’s circle of friends in online social networks. In Proc. WSDM, Feb. 2013, pp. 597–606.

  20. Gao S, Denoyer L, Gallinari P. Temporal link prediction by integrating content and structure information. In Proc. the 20th CIKM, Oct. 2011, pp. 1169–1174.

  21. Acar E, Dunlavy D, Kolda T. Link prediction on evolving data using matrix and tensor factorizations. In Proc. ICDM Workshops, Dec. 2009, pp. 262–269.

  22. Spiegel S, Clausen J, Albayrak S, Kunegis J. Link prediction on evolving data using tensor factorization. In Proc. PAKDD Workshops, May 2011, pp. 100–110.

  23. Romero D M, Kleinberg J. The directed closure process in hybrid social-information networks, with an analysis of link formation on Twitter. In Proc. the 4th ICWSM, May 2010.

  24. Salakhutdinov R, Mnih A. Probabilistic matrix factorization. In Proc. the 21st NIPS, Dec. 2007, pp. 1257–1264.

  25. Lee D, Seung H S. Algorithms for non-negative matrix factorization. In Proc. NIPS, Nov. 2000, pp. 556–562.

  26. Cialdini R B. Influence: Science and Practice. Allyn and Bacon/Pearson, 2001.

  27. Qu Q, Liu S, Jensen C S, Zhu F, Faloutsos C. Interestingness-driven diffusion process summarization in dynamic networks. In Proc. ECML PKDD, Part II, Sept. 2014, pp. 597–613.

  28. Yin Z, Gupta M, Weninger T, Han J. LINKREC: A unified framework for link recommendation with user attributes and graph structure. In Proc. the 19th WWW, Apr. 2010, pp. 1211–1212.

  29. Liben-Nowell D, Kleinberg J. The link prediction problem for social networks. In Proc. CIKM, Nov. 2003, pp. 556–559.

  30. Liu S, Wang S, Zhu F, Zhang J, Krishnan R. HYDRA: Large-scale social identity linkage via heterogeneous behavior modeling. In Proc. SIGMOD, June 2014, pp. 51–62.

  31. Jia Y, Wang Y, Li J, Feng K, Cheng X, Li J. Structural-interaction link prediction in Microblogs. In Proc. the 22nd WWW, May 2013, pp. 193–194.

  32. Liu D, Wang Y, Jia Y, Li J, Yu Z. From strangers to neighbors: Link prediction in microblogs using social distance game. In Proc. WSDM, Feb. 2014.

  33. Liu D, Wang Y, Jia Y et al. LSDH: A hashing approach for large-scale link prediction in microblogs. In Proc. the 28th AAAI, July 2014, pp. 3120–3121.

  34. Zhao Z, Jia Y, Wang Y, Cheng X. Content-structural relation inference in knowledge base. In Proc. the 28th AAAI, July 2014, pp. 3154–3155.

  35. Jia Y, Wang Y, Cheng X, Jin X, Guo J. OpenKN: An open knowledge computational engine for network big data. In Proc. ASONAM, Aug. 2014, pp. 657–664.

  36. Jia Y, Wang Y, Jin X, Cheng X. TSBM: The temporal-spatial Bayesian model for location prediction in social networks. In Proc. WI-IAT, Aug. 2014, pp. 194–201.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yan-Tao Jia.

Additional information

This work is supported by the National Basic Research 973 Program of China under Grant Nos. 2013CB329602 and 2014CB340405, the National Natural Science Foundation of China under Grant Nos. 61173008, 61232010, 60933005, 61402442, 61402022, and 61303244, Beijing Nova Program under Grant No. Z121101002512063, and the Natural Science Foundation of Beijing under Grant No. 4154086.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jia, YT., Wang, YZ. & Cheng, XQ. Learning to Predict Links by Integrating Structure and Interaction Information in Microblogs. J. Comput. Sci. Technol. 30, 829–842 (2015). https://doi.org/10.1007/s11390-015-1563-9

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-015-1563-9

Keywords

Navigation