Modeling and predicting cascading removal phenomenon over social networks

Razavi, Amir H.; Anggraini, Dyah; Missaoui, Rokia; Vaillancourt, Jean; Talbi, Mohamed

doi:10.1007/s13278-014-0233-1

Modeling and predicting cascading removal phenomenon over social networks

Original Article
Published: 28 October 2014

Volume 4, article number 233, (2014)
Cite this article

Social Network Analysis and Mining Aims and scope Submit manuscript

Amir H. Razavi¹,
Dyah Anggraini¹,
Rokia Missaoui¹,
Jean Vaillancourt¹ &
…
Mohamed Talbi¹

243 Accesses
1 Citation
Explore all metrics

Abstract

Innovations, opinions, ideas, recommendations or tendencies emerge in a variety of social networks. They can either disappear quickly or propagate and create considerable impact on the network. Their disappearance may also spread from one node to another across the network creating cascading behavior. Cascading phenomenon is mainly analyzed either by identifying the most influential nodes according to their features in the network, detecting quickly the phenomenon or targeting a minimum set of nodes that could maximize the spread of influence or minimize the propagation of a rumor or an outbreak. The objective of the present work is to predict the nodes to be deleted in cascade following the disappearance of one or many nodes. The cascading removal phenomenon is imitated by three well-known influence maximization cascading models in addition to two variants of a new cascading strategy which sound more consistent with human intuition over cascading removals. The prediction is done for an individual iteration of the cascading models, with the ability to be projected over the entire course of cascades without any loss of generality. We compare the prediction accuracy over three real-life networks and five synthetically generated schemas that imitate real social networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The homophily principle in social network analysis: A survey

Article 18 January 2022

Advances in Collaborative Filtering

Social influence and unfollowing accelerate the emergence of echo chambers

Article Open access 11 September 2020

Notes

A cut-point is a node whose removal from a graph disconnects it or, more generally, increases the number of components in the structure.
The normalized value of a given feature is computed as follows: (V _f – MIN_f)/(MAX_f – MIN_f), where V _f is the value of one of the mentioned features.
Parameter values are listed to help the reproducibility of the research.
This attribute can be also justified by increasing the number of training data.

References

Allen F, Gale DM (2000) Financial contagion. J Polit Econ 108(1):1–33
Article Google Scholar
Anderson RM, May RM (1992) Infectious diseases of humans. Oxford University Press, Oxford
Google Scholar
Blume L, Easley D, Kleinberg J, Kleinberg R, Tardos É (2011) Which networks are least susceptible to cascading failures? In: Paper presented at the FOCS’11 Proceedings of the 2011 IEEE 52nd annual symposium on foundations of computer science
Borgatti SP (2005) Centrality and network flow. Social Netw 27(1):55–71
Article Google Scholar
Brandes U (2001) A faster algorithm for betweenness centrality. J Math Sociol 25(2):163–177
Article MATH Google Scholar
Chen W, Wang Y, Yang S (2009) Efficient influence maximization in social networks.In: Paper presented at the KDD’09, pp 199–208
Cohen JE (1997) Size-estimation framework with applications to transitive closure and reachability. Comput Syst Sci 55(3):441–453
Article MATH Google Scholar
CORE (2012) Bridges and brokers in social network analysis. Defense analysis department at the Naval Postgraduate School
Cui P, Jin S, Yu L, Wang F, Zhu W, Yang S (2013) Cascading outbreak prediction in networks: a data-driven approach. Paper presented at the 19th ACM SIGKDD international conference on knowledge discovery and data mining, pp 901–909
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc Ser B (Methodological). 39(1):1–38
MathSciNet MATH Google Scholar
Domingos P, Richardson M (2002) Mining knowledge-sharing sites for viral marketing. Paper presented at the proceedings of the 8th ACM SIGKDD conference on knowledge discovery and data mining
Easley D, Kleinberg J (2010) Networks, crowds and markets: reasoning about a highly connected world. Cambridge University Press, Cambridge
Book Google Scholar
Everett MG, Borgatti SP (2005) Extending centrality. In: Carrington PJ, Scott J, Wasserman S (eds) Models and methods in social network analysis. Cambridge University Press, Cambridge, pp 57–76
Chapter Google Scholar
Everett MG, Borgatti SP (2006) A graph-theoretic perspective on centrality. Soc Netw 28(4):466–484
Article Google Scholar
Ferrara E (2012) A large-scale community structure analysis in Facebook. EPJ Data Sci 1(9):1–30
Google Scholar
Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174
Article MathSciNet Google Scholar
Fowler JH, Christakis NA (2009). Cooperative behaviour cascades in human social networks. http://arxiv.org/abs/0908.3497
Freeman LC (1977) A set of measures of centrality based on betweenness. Sociometry 40(1):35–41
Article Google Scholar
Genkin A, Lewis DD, Madigan D (2012) Large-scale bayesian logistic regression for text categorization. Technometrics 49(3):291–304
Article MathSciNet Google Scholar
Granovetter M (1978) Threshold models of collective behavior. Am J Sociol 83(6):1420–1443
Article Google Scholar
Gupta T, Garg S, Mahanti A, Carlsson N, Arlitt M (2009) A web-based social aggregation service. ICWSM
Hanneman RA, Riddle M (2005) Introduction to social network methods. http://faculty.ucr.edu/~hanneman/nettext/
Ilyas MU, Radha H (2011) Identifying influential nodes in online social networks using principal component centrality. In: Paper presented at the proceedings of IEEE international conference on communications(ICC), Kyoto
Kempe D, Kleinberg J, Tardos É (2003) Maximizing the spread of influence through a social network. Paper presented at the proceedings of the 9th ACM SIGKDD conference on knowledge discovery and data mining
Kempe D, Kleinberg J, Tardos É (2005) Influential nodes in a diffusion model for social networks. Automata, languages and programming. Springer, Berlin, pp 1127–1138
Chapter Google Scholar
Kleinberg J (2007) Cascading behavior in networks: algorithmic and economic issues. In: Cambridge (ed) Algorithmic game theory. Cambridge University Press, New York, pp 613–632
Chapter Google Scholar
Krebs V (2011) Social network analysis. A brief introduction. Accessed August 2011http://www.orgnet.com/sna.html
Lancichinetti A, Fortunato S (2009) Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys Rev E 80(1):016118
Article Google Scholar
Leskovec J, Krause A, Guestrin C, Faloutsos C, VanBriesen JM, Glance NS (2007) Cost-effective outbreak detection in networks. Paper presented at the KDD’07 proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, New York
Leskovec J, McGlohon M, Faloutsos C, Glance N, Hurst M (2007) Cascading behavior in large blog graphs. Paper presented at the proceedings of 7th SIAM international conference on data mining (SDM)
Newman MEJ (2001) The structure of scientific collaboration networks. Proc Natl Acad Sci U.S.A 98(2):404–409
Article MATH Google Scholar
Newman MEJ (2013) Network data. http://www-personal.umich.edu/~mejn/netdata/. Accessed April 19 2013
Nguyen NP, Dinh TN, Nguyen DT, Thai MT (2011). Overlapping community structures and their detection on social networks. In: Paper presented at the IEEE 3rd international conference on social computing (socialcom), Boston
Openflights (2014) Accessed May 1 2014 http://openflights.org/data.html
Zweig KA, Iyengar S (2010) An introductory course on network analysis. Accessed March 13 2013 https://sites.google.com/site/networkanalysisacourse/home
Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. Advances in kernel methods—support vector learning. MIT Press, Massachusetts, pp 185–208
Google Scholar
Sala A, Gaito S, Rossi GP, Zheng H, Zhao BY (2011) Revisiting degree distribution models for social graph analysis. (Electronic version)
Schneider JA, Cornwell B, Ostrow D, Michaels S, Schumm P, Laumann EO et al (2013) Network mixing and network influences most linked to HIV infection and risk behavior in the HIV epidemic among black-men who have sex with men. Am J Public Health 103(1):e28–e36
Article Google Scholar
Wasserman S, Faust K (1994) Social network analysis: methods and application, 1st edn. Cambridge University Press, Cambridge
Book Google Scholar
Wicklin R (2011) Modeling finite mixtures with the FMM procedure
Xuqing Huang, Irena Vodenska, Havlin S, Stanley H E (2013). Cascading failures in bi-partite graphs: model for systemic risk propagation. Sci Rep 3, Boston University, Boston
Zhuge H, Zhang J (2010) Topological centrality and its e-science applications. JASIST 61(9):1824–1841
Article Google Scholar
Zio E, Sansavini G (2011) Modeling cascading failures in systems of systems with uncertain behavior. Paper presented at the ICASP11, 11th international conference on applications of statistics and probability in civil engineering, pp 1858–1866

Download references

Acknowledgments

This study has been financially supported by the natural sciences and engineering research council of Canada (NSERC) research grant. The authors would like to thank the reviewers for their comments that help improve the manuscript.

Author information

Authors and Affiliations

Department of Computer Science, Université du Québec en Outaouais (UQO), Gatineau, QC, Canada
Amir H. Razavi, Dyah Anggraini, Rokia Missaoui, Jean Vaillancourt & Mohamed Talbi

Authors

Amir H. Razavi
View author publications
You can also search for this author in PubMed Google Scholar
Dyah Anggraini
View author publications
You can also search for this author in PubMed Google Scholar
Rokia Missaoui
View author publications
You can also search for this author in PubMed Google Scholar
Jean Vaillancourt
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Talbi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dyah Anggraini.

Appendix A

List of 100 network features that have been engineered, calculated and used in this paper:

1.
Average incoming weights before deletion.
2.
Minimum incoming weights before deletion.
3.
Maximum incoming weights before deletion.
4.
Average outgoing weights before deletion.
5.
Minimum outgoing weights before deletion.
6.
Maximum outgoing weights before deletion.
7.
PageRank of the node before deletion.
8.
Transitivity of the node before deletion.
9.
Indegree of the node before deletion.
10.
Outdegree of the node before deletion.
11.
Degree centrality before deletion.
12.
Betweenness centrality of the node before deletion.
13.
Closeness centrality of the node before deletion.
14.
Eigenvalue of the node before deletion.
15.
Coreness value of the node before deletion.
16.
Hub value of the node before deletion.
17.
Authority value of the node before deletion.
18.
Eccentricity value of the node before deletion.
19.
Number of community membership of a node before deletion.
20.
Average size of community membership of a node before deletion.
21.
Minimum size of community membership of a node before deletion.
22.
Maximum size of community membership of a node before deletion.
23.
If the node is a deleted one.
24.
Number of neighbors of deleted nodes.
25.
Average incoming weights of all neighbors of deleted nodes.
26.
Minimum incoming weights of all neighbors of deleted nodes.
27.
Maximum incoming weights of all neighbors of deleted nodes.
28.
Average outgoing weights of all neighbors of deleted nodes.
29.
Minimum outgoing weights of all neighbors of deleted nodes.
30.
Maximum outgoing weights of all neighbors of deleted nodes.
31.
Average PageRank of neighbors of deleted nodes.
32.
Minimum PageRank of neighbors of deleted nodes.
33.
Maximum PageRank of neighbors of deleted nodes.
34.
Average transitivity of neighbors of deleted nodes.
35.
Minimum transitivity of neighbors of deleted nodes.
36.
Maximum transitivity of neighbors of deleted nodes.
37.
Average degree of neighbors of deleted nodes.
38.
Minimum degree of neighbors of deleted nodes.
39.
Maximum degree of neighbors of deleted nodes.
40.
Average degree centrality of the neighbors of deleted nodes.
41.
Minimum degree centrality of the neighbors of deleted nodes.
42.
Maximum degree centrality of the neighbors of deleted nodes.
43.
Average betweenness centrality of the neighbors of deleted nodes.
44.
Minimum betweenness centrality of the neighbors of deleted nodes.
45.
Maximum betweenness centrality of the neighbors of deleted nodes.
46.
Average closeness centrality of the neighbors of deleted nodes.
47.
Minimum closeness centrality of the neighbors of deleted nodes.
48.
Maximum closeness centrality of the neighbors of deleted nodes.
49.
Average eigenvalues of the neighbors of deleted nodes.
50.
Minimum eigenvalues of the neighbors of deleted nodes.
51.
Maximum eigenvalues of the neighbors of deleted nodes.
52.
Average coreness values of the neighbors of deleted nodes.
53.
Minimum coreness values of the neighbors of deleted nodes.
54.
Maximum coreness values of the neighbors of deleted nodes.
55.
Average hub values of the neighbors of deleted nodes.
56.
Minimum hub values of the neighbors of deleted nodes.
57.
Maximum hub values of the neighbors of deleted nodes.
58.
Average authority values of the neighbors of deleted nodes.
59.
Minimum authority values of the neighbors of deleted nodes.
60.
Maximum authority values of the neighbors of deleted nodes.
61.
Average eccentricity values of the neighbors of deleted nodes.
62.
Minimum eccentricity values of the neighbors of deleted nodes.
63.
Maximum eccentricity values of the neighbors of deleted nodes.
64.
Average number of common neighbors among the neighbors of deleted nodes.
65.
Minimum number of common neighbors among the neighbors of deleted nodes.
66.
Maximum number of common neighbors among the neighbors of deleted nodes.
67.
Average number of common communities with the neighbors of deleted nodes.
68.
Minimum number of common communities with the neighbors of deleted nodes.
69.
Maximum number of common communities with the neighbors of deleted nodes.
70.
Average size of common communities with the neighbors of deleted nodes.
71.
Minimum size of common communities with the neighbors of deleted nodes.
72.
Maximum size of common communities with the neighbors of deleted nodes.
73.
Average incoming similarity with the neighbors of deleted nodes.
74.
Minimum incoming similarity with the neighbors of deleted nodes.
75.
Maximum incoming similarity with the neighbors of deleted nodes.
76.
Average outgoing similarity with the neighbors of deleted nodes.
77.
Minimum outgoing similarity with the neighbors of deleted nodes.
78.
Maximum outgoing similarity with the neighbors of deleted nodes.
79.
Average incoming weights after deletion.
80.
Minimum incoming weights after deletion.
81.
Maximum incoming weights after deletion.
82.
Average outgoing weights after deletion.
83.
Minimum outgoing weights after deletion.
84.
Maximum outgoing weights after deletion.
85.
PageRank of the node after deletion.
86.
Transitivity of the node after deletion.
87.
Indegree of the node after deletion.
88.
Outdegree of the node after deletion.
89.
Degree centrality after deletion.
90.
Betweenness centrality of the node after deletion.
91.
Closeness centrality of the node after deletion.
92.
Eigenvalue of the node after deletion.
93.
Coreness value of the node after deletion.
94.
Hub value of the node after deletion.
95.
Authority value of the node after deletion.
96.
Eccentricity value of the node after deletion.
97.
Number of community membership of a node after deletion.
98.
Average size of community membership of a node after deletion.
99.
Minimum size of community membership of a node after deletion.
100.
Maximum size of community membership of a node after deletion.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Razavi, A.H., Anggraini, D., Missaoui, R. et al. Modeling and predicting cascading removal phenomenon over social networks. Soc. Netw. Anal. Min. 4, 233 (2014). https://doi.org/10.1007/s13278-014-0233-1

Download citation

Received: 28 October 2013
Revised: 03 October 2014
Accepted: 07 October 2014
Published: 28 October 2014
DOI: https://doi.org/10.1007/s13278-014-0233-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modeling and predicting cascading removal phenomenon over social networks

Abstract

Access this article

Similar content being viewed by others

The homophily principle in social network analysis: A survey

Advances in Collaborative Filtering

Social influence and unfollowing accelerate the emergence of echo chambers

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix A

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Modeling and predicting cascading removal phenomenon over social networks

Abstract

Access this article

Similar content being viewed by others

The homophily principle in social network analysis: A survey

Advances in Collaborative Filtering

Social influence and unfollowing accelerate the emergence of echo chambers

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix A

Appendix A

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation