Skip to main content
Log in

Modeling and predicting cascading removal phenomenon over social networks

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Innovations, opinions, ideas, recommendations or tendencies emerge in a variety of social networks. They can either disappear quickly or propagate and create considerable impact on the network. Their disappearance may also spread from one node to another across the network creating cascading behavior. Cascading phenomenon is mainly analyzed either by identifying the most influential nodes according to their features in the network, detecting quickly the phenomenon or targeting a minimum set of nodes that could maximize the spread of influence or minimize the propagation of a rumor or an outbreak. The objective of the present work is to predict the nodes to be deleted in cascade following the disappearance of one or many nodes. The cascading removal phenomenon is imitated by three well-known influence maximization cascading models in addition to two variants of a new cascading strategy which sound more consistent with human intuition over cascading removals. The prediction is done for an individual iteration of the cascading models, with the ability to be projected over the entire course of cascades without any loss of generality. We compare the prediction accuracy over three real-life networks and five synthetically generated schemas that imitate real social networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. A cut-point is a node whose removal from a graph disconnects it or, more generally, increases the number of components in the structure.

  2. The normalized value of a given feature is computed as follows: (V f – MIN f)/(MAX f MIN f ), where V f is the value of one of the mentioned features.

  3. Parameter values are listed to help the reproducibility of the research.

  4. This attribute can be also justified by increasing the number of training data.

References

  • Allen F, Gale DM (2000) Financial contagion. J Polit Econ 108(1):1–33

    Article  Google Scholar 

  • Anderson RM, May RM (1992) Infectious diseases of humans. Oxford University Press, Oxford

    Google Scholar 

  • Blume L, Easley D, Kleinberg J, Kleinberg R, Tardos É (2011) Which networks are least susceptible to cascading failures? In: Paper presented at the FOCS’11 Proceedings of the 2011 IEEE 52nd annual symposium on foundations of computer science

  • Borgatti SP (2005) Centrality and network flow. Social Netw 27(1):55–71

    Article  Google Scholar 

  • Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177

    Article  MATH  Google Scholar 

  • Chen W, Wang Y, Yang S (2009) Efficient influence maximization in social networks.In: Paper presented at the KDD’09, pp 199–208

  • Cohen JE (1997) Size-estimation framework with applications to transitive closure and reachability. Comput Syst Sci 55(3):441–453

    Article  MATH  Google Scholar 

  • CORE (2012) Bridges and brokers in social network analysis. Defense analysis department at the Naval Postgraduate School

  • Cui P, Jin S, Yu L, Wang F, Zhu W, Yang S (2013) Cascading outbreak prediction in networks: a data-driven approach. Paper presented at the 19th ACM SIGKDD international conference on knowledge discovery and data mining, pp 901–909

  • Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B (Methodological). 39(1):1–38

    MathSciNet  MATH  Google Scholar 

  • Domingos P, Richardson M (2002) Mining knowledge-sharing sites for viral marketing. Paper presented at the proceedings of the 8th ACM SIGKDD conference on knowledge discovery and data mining

  • Easley D, Kleinberg J (2010) Networks, crowds and markets: reasoning about a highly connected world. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Everett MG, Borgatti SP (2005) Extending centrality. In: Carrington PJ, Scott J, Wasserman S (eds) Models and methods in social network analysis. Cambridge University Press, Cambridge, pp 57–76

    Chapter  Google Scholar 

  • Everett MG, Borgatti SP (2006) A graph-theoretic perspective on centrality. Soc Netw 28(4):466–484

    Article  Google Scholar 

  • Ferrara E (2012) A large-scale community structure analysis in Facebook. EPJ Data Sci 1(9):1–30

    Google Scholar 

  • Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174

    Article  MathSciNet  Google Scholar 

  • Fowler JH, Christakis NA (2009). Cooperative behaviour cascades in human social networks. http://arxiv.org/abs/0908.3497

  • Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41

    Article  Google Scholar 

  • Genkin A, Lewis DD, Madigan D (2012) Large-scale bayesian logistic regression for text categorization. Technometrics 49(3):291–304

    Article  MathSciNet  Google Scholar 

  • Granovetter M (1978) Threshold models of collective behavior. Am J Sociol 83(6):1420–1443

    Article  Google Scholar 

  • Gupta T, Garg S, Mahanti A, Carlsson N, Arlitt M (2009) A web-based social aggregation service. ICWSM

  • Hanneman RA, Riddle M (2005) Introduction to social network methods. http://faculty.ucr.edu/~hanneman/nettext/

  • Ilyas MU, Radha H (2011) Identifying influential nodes in online social networks using principal component centrality. In: Paper presented at the proceedings of IEEE international conference on communications(ICC), Kyoto

  • Kempe D, Kleinberg J, Tardos É (2003) Maximizing the spread of influence through a social network. Paper presented at the proceedings of the 9th ACM SIGKDD conference on knowledge discovery and data mining

  • Kempe D, Kleinberg J, Tardos É (2005) Influential nodes in a diffusion model for social networks. Automata, languages and programming. Springer, Berlin, pp 1127–1138

    Chapter  Google Scholar 

  • Kleinberg J (2007) Cascading behavior in networks: algorithmic and economic issues. In: Cambridge (ed) Algorithmic game theory. Cambridge University Press, New York, pp 613–632

    Chapter  Google Scholar 

  • Krebs V (2011) Social network analysis. A brief introduction. Accessed August 2011http://www.orgnet.com/sna.html

  • Lancichinetti A, Fortunato S (2009) Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys Rev E 80(1):016118

    Article  Google Scholar 

  • Leskovec J, Krause A, Guestrin C, Faloutsos C, VanBriesen JM, Glance NS (2007) Cost-effective outbreak detection in networks. Paper presented at the KDD’07 proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, New York

  • Leskovec J, McGlohon M, Faloutsos C, Glance N, Hurst M (2007) Cascading behavior in large blog graphs. Paper presented at the proceedings of 7th SIAM international conference on data mining (SDM)

  • Newman MEJ (2001) The structure of scientific collaboration networks. Proc Natl Acad Sci U.S.A 98(2):404–409

    Article  MATH  Google Scholar 

  • Newman MEJ (2013) Network data. http://www-personal.umich.edu/~mejn/netdata/. Accessed April 19 2013

  • Nguyen NP, Dinh TN, Nguyen DT, Thai MT (2011). Overlapping community structures and their detection on social networks. In: Paper presented at the IEEE 3rd international conference on social computing (socialcom), Boston

  • Openflights (2014) Accessed May 1 2014 http://openflights.org/data.html

  • Zweig KA, Iyengar S (2010) An introductory course on network analysis. Accessed March 13 2013 https://sites.google.com/site/networkanalysisacourse/home

  • Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. Advances in kernel methods—support vector learning. MIT Press, Massachusetts, pp 185–208

    Google Scholar 

  • Sala A, Gaito S, Rossi GP, Zheng H, Zhao BY (2011) Revisiting degree distribution models for social graph analysis. (Electronic version)

  • Schneider JA, Cornwell B, Ostrow D, Michaels S, Schumm P, Laumann EO et al (2013) Network mixing and network influences most linked to HIV infection and risk behavior in the HIV epidemic among black-men who have sex with men. Am J Public Health 103(1):e28–e36

    Article  Google Scholar 

  • Wasserman S, Faust K (1994) Social network analysis: methods and application, 1st edn. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Wicklin R (2011) Modeling finite mixtures with the FMM procedure

  • Xuqing Huang, Irena Vodenska, Havlin S, Stanley H E (2013). Cascading failures in bi-partite graphs: model for systemic risk propagation. Sci Rep 3, Boston University, Boston

  • Zhuge H, Zhang J (2010) Topological centrality and its e-science applications. JASIST 61(9):1824–1841

    Article  Google Scholar 

  • Zio E, Sansavini G (2011) Modeling cascading failures in systems of systems with uncertain behavior. Paper presented at the ICASP11, 11th international conference on applications of statistics and probability in civil engineering, pp 1858–1866

Download references

Acknowledgments

This study has been financially supported by the natural sciences and engineering research council of Canada (NSERC) research grant. The authors would like to thank the reviewers for their comments that help improve the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dyah Anggraini.

Appendix A

Appendix A

List of 100 network features that have been engineered, calculated and used in this paper:

  1. 1.

    Average incoming weights before deletion.

  2. 2.

    Minimum incoming weights before deletion.

  3. 3.

    Maximum incoming weights before deletion.

  4. 4.

    Average outgoing weights before deletion.

  5. 5.

    Minimum outgoing weights before deletion.

  6. 6.

    Maximum outgoing weights before deletion.

  7. 7.

    PageRank of the node before deletion.

  8. 8.

    Transitivity of the node before deletion.

  9. 9.

    Indegree of the node before deletion.

  10. 10.

    Outdegree of the node before deletion.

  11. 11.

    Degree centrality before deletion.

  12. 12.

    Betweenness centrality of the node before deletion.

  13. 13.

    Closeness centrality of the node before deletion.

  14. 14.

    Eigenvalue of the node before deletion.

  15. 15.

    Coreness value of the node before deletion.

  16. 16.

    Hub value of the node before deletion.

  17. 17.

    Authority value of the node before deletion.

  18. 18.

    Eccentricity value of the node before deletion.

  19. 19.

    Number of community membership of a node before deletion.

  20. 20.

    Average size of community membership of a node before deletion.

  21. 21.

    Minimum size of community membership of a node before deletion.

  22. 22.

    Maximum size of community membership of a node before deletion.

  23. 23.

    If the node is a deleted one.

  24. 24.

    Number of neighbors of deleted nodes.

  25. 25.

    Average incoming weights of all neighbors of deleted nodes.

  26. 26.

    Minimum incoming weights of all neighbors of deleted nodes.

  27. 27.

    Maximum incoming weights of all neighbors of deleted nodes.

  28. 28.

    Average outgoing weights of all neighbors of deleted nodes.

  29. 29.

    Minimum outgoing weights of all neighbors of deleted nodes.

  30. 30.

    Maximum outgoing weights of all neighbors of deleted nodes.

  31. 31.

    Average PageRank of neighbors of deleted nodes.

  32. 32.

    Minimum PageRank of neighbors of deleted nodes.

  33. 33.

    Maximum PageRank of neighbors of deleted nodes.

  34. 34.

    Average transitivity of neighbors of deleted nodes.

  35. 35.

    Minimum transitivity of neighbors of deleted nodes.

  36. 36.

    Maximum transitivity of neighbors of deleted nodes.

  37. 37.

    Average degree of neighbors of deleted nodes.

  38. 38.

    Minimum degree of neighbors of deleted nodes.

  39. 39.

    Maximum degree of neighbors of deleted nodes.

  40. 40.

    Average degree centrality of the neighbors of deleted nodes.

  41. 41.

    Minimum degree centrality of the neighbors of deleted nodes.

  42. 42.

    Maximum degree centrality of the neighbors of deleted nodes.

  43. 43.

    Average betweenness centrality of the neighbors of deleted nodes.

  44. 44.

    Minimum betweenness centrality of the neighbors of deleted nodes.

  45. 45.

    Maximum betweenness centrality of the neighbors of deleted nodes.

  46. 46.

    Average closeness centrality of the neighbors of deleted nodes.

  47. 47.

    Minimum closeness centrality of the neighbors of deleted nodes.

  48. 48.

    Maximum closeness centrality of the neighbors of deleted nodes.

  49. 49.

    Average eigenvalues of the neighbors of deleted nodes.

  50. 50.

    Minimum eigenvalues of the neighbors of deleted nodes.

  51. 51.

    Maximum eigenvalues of the neighbors of deleted nodes.

  52. 52.

    Average coreness values of the neighbors of deleted nodes.

  53. 53.

    Minimum coreness values of the neighbors of deleted nodes.

  54. 54.

    Maximum coreness values of the neighbors of deleted nodes.

  55. 55.

    Average hub values of the neighbors of deleted nodes.

  56. 56.

    Minimum hub values of the neighbors of deleted nodes.

  57. 57.

    Maximum hub values of the neighbors of deleted nodes.

  58. 58.

    Average authority values of the neighbors of deleted nodes.

  59. 59.

    Minimum authority values of the neighbors of deleted nodes.

  60. 60.

    Maximum authority values of the neighbors of deleted nodes.

  61. 61.

    Average eccentricity values of the neighbors of deleted nodes.

  62. 62.

    Minimum eccentricity values of the neighbors of deleted nodes.

  63. 63.

    Maximum eccentricity values of the neighbors of deleted nodes.

  64. 64.

    Average number of common neighbors among the neighbors of deleted nodes.

  65. 65.

    Minimum number of common neighbors among the neighbors of deleted nodes.

  66. 66.

    Maximum number of common neighbors among the neighbors of deleted nodes.

  67. 67.

    Average number of common communities with the neighbors of deleted nodes.

  68. 68.

    Minimum number of common communities with the neighbors of deleted nodes.

  69. 69.

    Maximum number of common communities with the neighbors of deleted nodes.

  70. 70.

    Average size of common communities with the neighbors of deleted nodes.

  71. 71.

    Minimum size of common communities with the neighbors of deleted nodes.

  72. 72.

    Maximum size of common communities with the neighbors of deleted nodes.

  73. 73.

    Average incoming similarity with the neighbors of deleted nodes.

  74. 74.

    Minimum incoming similarity with the neighbors of deleted nodes.

  75. 75.

    Maximum incoming similarity with the neighbors of deleted nodes.

  76. 76.

    Average outgoing similarity with the neighbors of deleted nodes.

  77. 77.

    Minimum outgoing similarity with the neighbors of deleted nodes.

  78. 78.

    Maximum outgoing similarity with the neighbors of deleted nodes.

  79. 79.

    Average incoming weights after deletion.

  80. 80.

    Minimum incoming weights after deletion.

  81. 81.

    Maximum incoming weights after deletion.

  82. 82.

    Average outgoing weights after deletion.

  83. 83.

    Minimum outgoing weights after deletion.

  84. 84.

    Maximum outgoing weights after deletion.

  85. 85.

    PageRank of the node after deletion.

  86. 86.

    Transitivity of the node after deletion.

  87. 87.

    Indegree of the node after deletion.

  88. 88.

    Outdegree of the node after deletion.

  89. 89.

    Degree centrality after deletion.

  90. 90.

    Betweenness centrality of the node after deletion.

  91. 91.

    Closeness centrality of the node after deletion.

  92. 92.

    Eigenvalue of the node after deletion.

  93. 93.

    Coreness value of the node after deletion.

  94. 94.

    Hub value of the node after deletion.

  95. 95.

    Authority value of the node after deletion.

  96. 96.

    Eccentricity value of the node after deletion.

  97. 97.

    Number of community membership of a node after deletion.

  98. 98.

    Average size of community membership of a node after deletion.

  99. 99.

    Minimum size of community membership of a node after deletion.

  100. 100.

    Maximum size of community membership of a node after deletion.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Razavi, A.H., Anggraini, D., Missaoui, R. et al. Modeling and predicting cascading removal phenomenon over social networks. Soc. Netw. Anal. Min. 4, 233 (2014). https://doi.org/10.1007/s13278-014-0233-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-014-0233-1

Keywords

Navigation