Abstract
As one of the significant issues in social networks analysis, the influence maximization problem aims to fetch a minimal set of the most influential individuals in the network to maximize the number of influenced nodes under a diffusion model. Several approaches have been proposed to tackle this NP-hard problem. The traditional approaches failed to develop an efficient and effective solution due to the exponential growth of the size of social networks (due to massive computational overhead). In this paper, a three-stage framework based on the community detection approach is devised, namely LGFIM. In the first stage, the search space was controlled by partitioning the network into communities. Simultaneously, three heuristic methods were presented for modifying the community detection algorithm to extract the optimal communities: core nodes selection, capacity constraint on communities, and communities combination. These extracted communities were highly compatible with the information propagation mechanism. The next stages apply a scalable and robust algorithm at two different levels of the network: 1. Exploring the local scope of communities to select the most influential nodes of each community and construct the potential influential nodes set 2. Exploring the global scope of the network to select the target influential nodes among potential influential nodes set. Experimental results on various real datasets proved that LGFIM could achieve remarkable results compared with the state-of-the-art algorithms, especially acceptable influence spread, much better running time, and more applicable to massive social networks.










Similar content being viewed by others
Availability of data and materials
All of the mentioned datasets are available at the SNAP library on the Stanford University website https://snap.stanford.edu/data/ or http://konect.cc/networks/ website.
Code availability
These codes are available at https://github.com/mohsentaherinia/LGFIM.
References
Bohra N, Bhatnagar V (2021) Group level social media popularity prediction by MRGB and Adam optimization. J Comb Optim 41(2):328–347
Sangwan N, Bhatnagar V (2021) Video popularity prediction using stacked bilstm layers. Malays J Comput Sci 34(3):242–254
Malik S, Bansal P (2022) Multimodal semantic analysis with regularized semantic autoencoder. J Intell Fuzzy Syst 42:909–917
Banerjee S, Jenamani M, Pratihar DK (2020) A survey on influence maximization in a social network. Knowl Inf Syst 62:3417–3455
Li Y, Fan J, Wang Y, Tan K-LL (2018) Influence maximization on social graphs: a survey. IEEE Trans Knowl Data Eng 30:1852–1872
Domingos P, Richardson M (2001) Mining the network value of customers. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 57–66. Association for Computing Machinery (ACM)
Kempe D, Kleinberg J, Tardose (2003) Maximizing the spread of influence through a social network. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Sumith N, Annappa B, Bhattacharya S (2018) Influence maximization in large social networks: Heuristics, models and parameters. Future Gener Comput Syst 89:777–790
Peng S, Yu S, Mueller P (2018) Social networking big data: opportunities, solutions, and challenges. Future Gener Comput Syst 86:1456–1458
Han M, Li Y (2018) Influence analysis: a survey of the state-of-the-art. Math Found Comput 1(3):201–253
Wang Z, Sun C, Xi J, Li X (2021) Influence maximization in social graphs based on community structure and node coverage gain. Future Gener Comput Syst 118:327–338
He Q, Wang X, Mao F, Lv J, Cai Y, Huang M, Xu Q (2020) CAOM: a community-based approach to tackle opinion maximization for social networks. Inf Sci 513:252–269
Huang H, Shen H, Meng Z (2020) Community-based influence maximization in attributed networks. Appl Intell 50:354–364
Beni HA, Bouyer A (2020) TI-SC: top-k influential nodes selection based on community detection and scoring criteria in social networks. J Ambient Intell Humaniz Comput 1:3
Zhang Z, Li X, Gan C (2020) Identifying influential nodes in social networks via community structure and influence distribution difference. Digit Commun Netw
Li J, Cai T, Deng K, Wang X, Sellis T, Xia F (2020) Community-diversified influence maximization in social networks. Inf Syst 92:101522
Atif Y, Al-Falahi K, Wangchuk T, Lindström B (2020) A fuzzy logic approach to influence maximization in social networks. J Ambient Intell Hum Comput 11:2435–2451
Banerjee S, Jenamani M, Pratihar DK (2019) ComBIM: a community-based solution approach for the Budgeted Influence Maximization Problem. Expert Syst Appl 125:1–13
Huang H, Shen H, Meng Z, Chang H, He H (2019) Community-based influence maximization for viral marketing. Appl Intell 49:2137–2150
Singh SS, Kumar A, Singh K, Biswas B (2019) C2IM: community based context-aware influence maximization in social networks. Physica A 514:796–818
Liqing Q, Wei J, Jinfeng Y, Xin F, Shuang Z (2019) PHG: a three-phase algorithm for influence maximization based on community structure. IEEE Access 1
Li X, Cheng X, Su S, Sun C (2018) Community-based seeds selection algorithm for location aware influence maximization. Neurocomputing 275:1601–1613
Ko YY, Cho KJ, Kim SW (2018) Efficient and effective influence maximization in social networks: a hybrid-approach. Inf Sci 465:144–161
Ye F, Liu J, Chen C, Ling G, Zheng Z, Zhou Y (2018) Identifying influential individuals on large-scale social networks: a community based approach. IEEE Access 6:47240–47257
Jalayer M, Azheian M, Kermani Agha Mohammad Ali M (2018) A hybrid algorithm based on community detection and multi attribute decision making for influence maximization. Comput Ind Eng 120:234–250
Shang J, Zhou S, Li X, Liu L, Wu H (2017) CoFIM: a community-based framework for influence maximization on large-scale networks. Knowl-Based Syst 117:88–100
Bozorgi A, Samet S, Kwisthout J, Wareham T (2017) Community-based influence maximization in social networks under a competitive linear threshold model. Knowl-Based Syst 134:149–158
Hosseini-Pozveh M, Zamanifar K, Naghsh-Nilchi AR (2017) A community-based approach to identify the most influential nodes in social networks. J Inf Sci 43:204–220
Halappanavar M, Sathanur AV, Nandi AK (2016) Accelerating the mining of influential nodes in complex networks through community detection. In: 2016 ACM International Conference on Computing Frontiers - Proceedings, New York, New York, USA, pp 64–71. Association for Computing Machinery, Inc
Bozorgi A, Haghighi H, Sadegh Zahedi M, Rezvani M (2016) INCIM: a community-based algorithm for influence maximization problem under the linear threshold model. Inf Process Manag 52:1188–1199
Jaouadi M, Ben Romdhane L (2016) DIN: an efficient algorithm for detecting influential nodes in social graphs using network structure and attributes. In: Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA. IEEE Computer Society
Zhao Y, Li S, Jin F (2016) Identification of influential nodes in social networks with community structure based on label propagation. Neurocomputing 210:34–44
Rahimkhani K, Aleahmad A, Rahgozar M, Moeini A (2015) A fast algorithm for finding most influential people based on the linear threshold model. Expert Syst Appl 42:1353–1361
Li H, Bhowmick SS, Sun A, Cui J (2015) Conformity-aware influence maximization in online social networks. VLDB J 24:117–141
Chen YC, Zhu WY, Peng WC, Lee WC, Lee SY (2014) CIM: community-based influence maximization in social networks. ACM Trans Intell Syst Technol 5:1–31
Wang Y, Cong G, Song G, Xie K (2010) Community-based greedy algorithm for mining top-K influential nodes in mobile social networks. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD ’10, New York, New York, USA, p 1039. ACM Press
Fortunato S, Barthelemy M (2006) Resolution limit in community detection. Proc Natl Acad Sci USA 104:36–41
Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E Stat Nonlinear Soft Matter Phys 76:036106
Leung IXY, Hui P, Lio P, Crowcroft J (2008) Towards real-time community detection in large networks. Phys Rev Stat Nonlinear Soft Matter Phys 79:066107
Barber MJ, Clark JW (2009) Detecting network communities by propagating labels under constraints. Phys Rev Stat Nonlinear Soft Matter Phys 80:026129
Šubelj L, Bajec M (2011) Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction. Phys Rev E Stat Nonlinear Soft Matter Phys 83:036103
Rezaei A, Far SM, Soleymani M (2015) Near linear-time community detection in networks with hardly detectable community structure. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015, pp 65–72. Association for Computing Machinery, Inc
Clauset A, Newman MEJ, Moore C (2004) Finding community structure in very large networks. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Top 70:6
Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008:P10008
Rosvall M, Bergstrom CT (2007) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci USA 105:1118–1123
Xavier CR , Da Fonseca Vieira V, Evsukoff AG (2019) Studying the effect of community structure for seed selection in an influence model. Int J Inf Decis Sci 11(4):300–319
Leskovec J, Krause A, Guestrin C, Faloutsos C, Vanbriesen J, Glance N (2007) Cost-effective outbreak detection in networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, New York, USA, pp 420–429. ACM Press
Goyal A, Lu W, Lakshmanan L. V. (2011) CELF++: optimizing the greedy algorithm for influence maximization in social networks. In: Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011, pp 47–48
Tang Y, Xiao X, Shi Y (2014) Influence maximization: Near-optimal time complexity meets practical efficiency. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, New York, New York, USA, pp 75–86. Association for Computing Machinery
Tang Y, Shi Y, Xiao X (2015) Influence maximization in near-linear time: a martingale approach. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, vol. 2015-May, New York, New York, USA, pp 1539–1554. Association for Computing Machinery
Chen W, Wang Y, Yang S (2009) Efficient influence maximization in social networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, New York, USA, pp 199–207. ACM Press
Cheng S, Shen H, Huang J, Zhang G, Cheng X (2013) StaticGreedy: solving the scalability-accuracy dilemma in influence maximization. In: International Conference on Information and Knowledge Management, Proceedings, New York, New York, USA, pp 509–518. ACM Press
Peng S, Zhou Y, Cao L, Yu S, Niu J, Jia W (2018) Influence analysis in social networks: a survey. J Netw Comput Appl 106:17–32
Page L, Page L, Brin S, Motwani R, Winograd T (1998) The PageRank citation ranking: bringing order to the web
Kleinberg JM (1999) Authoritative sources in a hyperlinked environment. J ACM 46(5):604–632
Kimura M, Saito K, Nakano R, Motoda H (2010) Extracting influential nodes on a social network for information diffusion. Data Min Knowl Discov 20:70–97
Wang C, Chen W, Wang, Y (2012) Scalable influence maximization for independent cascade model in large-scale social networks. In: Data Mining and Knowledge Discovery, vol. 25, pp 545–576. Springer
Chen W, Wang C, Wang Y (2010) Scalable influence maximization for prevalent viral marketing in large-scale social networks. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, New York, USA, pp 1029–1038. ACM Press
Kim J, Kim SK, Yu H (2013) Scalable and parallelizable processing of influence maximization for large-scale social networks? In: Proceedings of International Conference on Data Engineering, pp 266–277
Galhotra S, Arora A, Roy S (2016) Holistic influence maximization: combining scalability and efficiency with opinion-aware models. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, vol. 26-June-20, pp 743–758
Goyal A, Lu W, Lakshmanan LV (2011) SIMPATH: an efficient algorithm for influence maximization under the linear threshold model. In: 2011 IEEE 11th International Conference on Data Mining, pp 211–220. IEEE
Chen W, Yuan Y, Zhang L (2010) Scalable influence maximization in social networks under the linear threshold model. In: Proceedings of IEEE International Conference on Data Mining, ICDM, pp 88–97
Jung K, Heo W, Chen W (2012) IRIE: scalable and robust influence maximization in social networks. In: Proceedings of IEEE International Conference on Data Mining, ICDM, pp 918–923
Liu Q, Xiang B, Chen E, Xiong H, Tang F, Yu JX (2014) Influence maximization over large-scale social networks: a bounded linear approach. In: CIKM 2014—Proceedings of the 2014 ACM International Conference on Information and Knowledge Management, New York, New York, USA, pp 171–180.Association for Computing Machinery, Inc
Cheng S, Shen H-W, Huang J, Chen W, Cheng X-Q (2014) IMRank: influence maximization via finding self-consistent ranking. In: SIGIR 2014 - Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 475–484
Bahutair M, Al Aghbari Z, Kamel I (2021) NodeRank: finding influential nodes in social networks based on interests. J Supercomput 1–27
Khan BS, Niazi MA (2017) Network community detection: a review and visual survey. arXiv
Xie J, Szymanski BK, Liu X (2011) SLPA: uncovering overlapping communities in social networks via a speaker-listener interaction dynamic process.’ In: Proceedings - IEEE International Conference on Data Mining, ICDM, pp 344–349
Khomami MMD, Rezvanian A, Meybodi MR, Bagheri A (2020) CFIN: a community-based algorithm for finding influential nodes in complex social networks. J Supercomput 1–30
Samir AM, Rady S, Gharib TF (2021) LKG: a fast scalable community-based approach for influence maximization problem in social networks. Physica A 582:126258
Xie X, Li J, Sheng Y, Wang W, Yang W (2021) Competitive influence maximization considering inactive nodes and community homophily. Knowl-Based Syst 233:107497
Ahmad A, Ahmad T, Bhatt A (2020) HWSMCB: a community-based hybrid approach for identifying influential nodes in the social network. Physica A 545:123590
Amiri B, Fathian M, Asaadi E (2021) Influence maximization in complex social networks based on community structure. J Ind Syst Eng 13(3):16–40
Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci USA 103:8577–8582
Douglas JF (1997) Random walks and random environments, vol. 2, random environments. J Stat Phys 87:961–962
Ahn Y-Y, Bagrow JP, Lehmann S (2009) Link communities reveal multiscale complexity in networks. Nature 466:761–764
Palla G, Derenyi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society
Scripps J, Tan PN, Esfahanian AH (2007) Exploration of link structure and community-based node roles in network analysis. In: Proceedings of IEEE International Conference on Data Mining, ICDM, pp 649–654. IEEE
Panagopoulosécole G, Malliaros FD, Vazirgiannisécole M (2020) Influence maximization using influence and susceptibility embeddings. Technical Report
Leskovec J, Sosič R (2016) SNAP: a general-purpose network analysis and graph-mining library. ACM Trans Intell Syst Technol (TIST) 8(1):1
Kunegis J (2013) The Koblenz Network Collection (KONECT)
Leskovec J, Krevl A (2014) Snap datasets, s.n.a.p. and stanford . Acessed Oct 25 2020
Haque M, Banerjee DS (2019) Accelerating influence maximization using heterogeneous algorithms. J Supercomput 76:4747–4769
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. The authors would like to thank the anonymous reviewers for their valuable comments that helped to improve the quality of this manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Taherinia, M., Esmaeili, M. & Minaei-Bidgoli, B. A high-performance algorithm for finding influential nodes in large-scale social networks. J Supercomput 78, 15905–15952 (2022). https://doi.org/10.1007/s11227-022-04418-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-022-04418-2