Skip to main content
Log in

Learning content–social influential features for influence analysis

  • Regular Paper
  • Published:
International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Abstract

We address how to measure the information propagation probability between users given certain contents. In sharp contrast to existing works that oversimplify the propagation model as predefined distributions, our approach fundamentally attempts to answer why users are influenced (e.g., by content or relations) and whether the corresponding influential features (e.g., hidden factors) can be inferred from the propagation in the entire network. In particular, we propose a novel method to deeply learn the unified feature representations for both user pair and content, where the homogeneous feature similarity can be used to estimate the propagation probability between users with given content. The features are dubbed content–social influential feature since we consider not only the content of the propagation information but also how it propagates over the social network. We design a fast asynchronous parallel algorithm for the feature learning. Through extensive experiments on a real-world social network with 53 million users and 838 million tweets, we show significantly improved performance as compared to other state-of-the-art methods on various social influence analysis tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. By inductively apply the fact \(\sigma (x)+\sigma (-x) = 1\).

  2. Zombies refer to soul-less accounts that post no original content, run by the shady individuals who take customers’ money in exchange for these new “fans”. Bots refer to machine generated user accounts.

  3. We did not find any significant performance variance by using 10 different random split. Therefore, we just arbitrarily chose one split.

References

  1. Bakshy E, Hofman JM, Mason WA, Watts DJ (2011) Everyone’s an influencer: quantifying influence on twitter. In: Proceedings of the 4th ACM international conference on Web search and data mining. ACM, pp 65–74

  2. Barbieri N, Bonchi F, Manco G (2012) Topic-aware social influence propagation models. In: Proceedings of the 12th IEEE international conference on data mining. IEEE, pp 81–90

  3. Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127

    Article  MathSciNet  MATH  Google Scholar 

  4. Bian J, Yang Y, Chua TS (2014) Predicting trending messages and diffusion participants in microblogging network. In: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval. ACM, pp 537–546

  5. Bordes A, Weston J, Collobert R, Bengio Y et al (2011) Learning structured embeddings of knowledge bases. In: Conference on artificial intelligence

  6. Cha M, Haddadi H, Benevenuto F, Gummadi PK (2010) Measuring user influence in twitter: the million follower fallacy. ICWSM 10:10–17

    Google Scholar 

  7. Domingos P, Richardson M (2001) Mining the network value of customers. In: Proceedings of the 7th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 57–66

  8. Dong W, Pentland A (2007) Modeling influence between experts. In: Artifical intelligence for human computing. Springer, Berlin, pp 170–189

  9. Du N, Song L, Yuan M, Smola AJ (2012) Learning networks of heterogeneous influence. In: Advances in neural information processing systems

  10. Freeman LC (2004) The development of social network analysis: a study in the sociology of science, vol 1. Empirical Press, Vancouver

    Google Scholar 

  11. Goldenberg J, Libai B, Muller E (2001) Talk of the network: a complex systems look at the underlying process of word-of-mouth. Mark Lett 12(3):211–223

    Article  Google Scholar 

  12. Gomez Rodriguez M, Leskovec J, Krause A (2010) Inferring networks of diffusion and influence. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining

  13. Goyal A, Bonchi F, Lakshmanan LV (2010) Learning influence probabilities in social networks. In: Proceedings of the 3rd ACM international conference on Web search and data mining. ACM, pp 241–250

  14. Goyal A, Lu W, Lakshmanan LV (2011) Celf++: optimizing the greedy algorithm for influence maximization in social networks. In: Proceedings of the 20th international conference companion on World wide web. ACM, pp 47–48

  15. Haveliwala TH (2002) Topic-sensitive pagerank. In: Proceedings of the 11th international conference on World Wide Web. ACM, pp 517–526

  16. Heidemann J, Klier M, Probst F. Identifying key users in online social networks: a pagerank based approach. In: Proceedings of 31st international conference on information systems (ICIS)

  17. Kempe D, Kleinberg J, Tardos É (2003) Maximizing the spread of influence through a social network. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 137–146

  18. Kwak H, Lee C, Park H, Moon S (2010) What is twitter, a social network or a news media? In: Proceedings of the 19th international conference on World wide web. ACM, pp 591–600

  19. Li N, Gillet D (2013) Identifying influential scholars in academic social media platforms. In: Proceedings of the 2013 IEEE/ACM international conference on advances in social networks analysis and mining. ACM, pp 608–614

  20. Liu B, Cong G, Zeng Y, Xu D, Chee YM (2014) Influence spreading path and its application to the time constrained social influence maximization problem and beyond. IEEE Trans Knowl Data Eng 26(8):1904–1917

    Article  Google Scholar 

  21. Liu L, Tang J, Han J, Jiang M, Yang S (2010) Mining topic-level influence in heterogeneous networks. In: Proceedings of the 19th ACM international conference on Information and knowledge management. ACM, pp 199–208

  22. Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119

  23. Morin F, Bengio Y (2005) Hierarchical probabilistic neural network language model. In: AISTATS, vol 5. Citeseer, pp 246–252

  24. Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining

  25. Qian N (1999) On the momentum term in gradient descent learning algorithms. Neural Netw 12(1):145–151

    Article  Google Scholar 

  26. Rashotte L (2007) Social influence. Blackwell Encycl Soc Psychol 9:562–563

    Google Scholar 

  27. Richardson M, Domingos P (2002) Mining knowledge-sharing sites for viral marketing. In: Proceedings of the 8th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 61–70

  28. Rodriguez MG, Balduzzi D, Schölkopf B (2011) Uncovering the temporal dynamics of diffusion networks. arXiv preprint arXiv:1105.0697

  29. Rudat A, Buder J (2015) Making retweeting social: the influence of content and context information on sharing news in twitter. Comput Hum Behav 46:75–84

    Article  Google Scholar 

  30. Tang J, Sun J, Wang C, Yang Z (2009) Social influence analysis in large-scale networks. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 807–816

  31. Tang L, Liu H (2009) Relational learning via latent social dimensions. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 817–826

  32. Wang S, Hu X, Yu PS, Li Z (2014) Mmrate: inferring multi-aspect diffusion networks with multi-pattern cascades. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining

  33. Weng J, Lim EP, Jiang J, He Q (2010) Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the 3rd ACM international conference on Web search and data mining. ACM, pp 261–270

  34. Xiao C, Zhang Y, Zeng X, Wu Y (2013) Predicting user influence in social media. J Netw 8(11):2649–2655

    Google Scholar 

  35. Zhang J, Liu B, Tang J, Chen T, Li J (2013) Social influence locality for modeling retweeting behaviors. In: Proceedings of the 23rd international joint conference on artificial intelligence. AAAI Press, pp 2761–2767

  36. Zhang M, Sun C, Liu W. Identifying influential users of micro-blogging services: a dynamic action-based network approach. In: Proceedings of PACIS

  37. Zhou C, Zhang P, Zang W, Guo L (2015) On the upper bounds of spread for greedy algorithms in social network influence maximization. IEEE Trans Knowl Data Eng 1904–1917

Download references

Acknowledgments

This work was partially supported by the NUS-Tsinghua Extreme Search (NExT) project (Grant R-252-300-001-490). NExT research is supported by the National Research Foundation, Prime Minister’s Office, Singapore under its IRC@SG Funding Initiative.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Na Zhao.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, N., Zhang, H., Wang, M. et al. Learning content–social influential features for influence analysis. Int J Multimed Info Retr 5, 137–149 (2016). https://doi.org/10.1007/s13735-016-0102-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13735-016-0102-y

Keywords

Navigation