Abstract
Software systems can be represented as complex networks and their artificial nature can be investigated with approaches developed in network analysis. Influence maximization has been successfully applied on software networks to identify the important nodes that have the maximum influence on the other parts. However, research is open to study the effects of network fabric on the influence behavior of the highly influential nodes. In this paper, we construct class dependence graph (CDG) networks based on eight practical Java software systems, and apply the procedure of influence maximization to study empirically the correlations between the characteristics of maximum influence and the degree distributions in the software networks. We demonstrate that the artificial nature of CDG networks is reflected partly from the scale free behavior: the in-degree distribution follows power law, and the out-degree distribution is lognormal. For the influence behavior, the expected influence spread of the maximum influence set identified by the greedy method correlates significantly with the degree distributions. In addition, the identified influence set contains influential classes that are complex in both the number of methods and the lines of code (LOC). For the applications in software engineering, the results provide possibilities of new approaches in designing optimization procedures of software systems.
Similar content being viewed by others
References
Myers C R. Software systems as complex networks: structure, function, and evolvability of software collaboration graphs. Phys Rev E, 2003, 68: 046116
Jenkins S, Kirk S R. Software architecture graphs as complex networks: a novel partitioning scheme to measure stability and evolution. Inform Sciences, 2007, 177: 2587–2601
Zheng X, Zeng D, Li H, et al. Analyzing open-source software systems as complex networks. Physica A, 2008, 387: 6190–6200
Cai K Y, Yin B B. Software execution processes as an evolving complex network. Inform Sciences, 2009, 179: 1903–1928
De Moura A P S, Lai Y C, Motter A E. Signatures of small-world and scale-free properties in large computer programs. Phys Rev E, 2003, 68: 017102
Concas G, Marchesi M, Pinna S, et al. Power-laws in a large object-oriented software system. IEEE Trans Softw Eng, 2007, 33: 687–708
Maillart T, Sornette D, Spaeth S, et al. Empirical tests of Zipf’s law mechanism in open source linux distribution. Phys Rev Lett, 2008, 101: 218701
Kohring G A. Complex dependencies in large software systems. Adv Complex Syst, 2009, 12: 565–581
Chepelianskii A D. Towards physical laws for software architecture. arXiv: 1003.5455, 2010
Šubelj L, Bajec M. Community structure of complex software systems: analysis and applications. Physica A, 2011, 390: 2968–2975
LaBelle N, Wallingford E. Inter-package dependency networks in open-source software. arXiv:0411096, 2004
Yang F, Lv J, Mei H. Technical framework for Internetware: an architecture centric approach. Sci China Ser F-Inf Sci, 2008, 51: 610–622
Mei H, Huang G, Lan L, et al. A software architecture centric self-adaptation approach for Internetware. Sci China Ser F-Inf Sci, 2008, 51: 722–742
Lv J, Ma X, Tao X P, et al. On environment-driven software model for Internetware. Sci China Ser F-Inf Sci, 2008, 51: 683–721
Valverde S, Solé R V. Hierarchical small-worlds in software architecture. arXiv:0307278, 2007
Strogatz S H. Exploring complex networks. Nature, 2001, 410: 268–276
Albert R, Barabási A L. Statistical mechanics of complex networks. Rev Mod Phys, 2002, 74: 47–97
Bhattacharya P, Iliofotou M, Neamtiu I, et al. Graph-based analysis and prediction for software evolution. In: Proceedings of the International Conference on Software Engineering, Zurich, 2012. 419–429
Newman M E J. The structure and function of complex networks. SIAM Rev, 2003, 45: 167–256
Concas G, Marchesi M, Pinna S, et al. On the suitability of yule process to stochastically model some properties of object-oriented systems. Physica A, 2006, 370: 817–831
Louridas P, Spinellis D, Vlachos V. Power laws in software. ACM Trans Softw Eng Meth, 2008, 18: 2
Kitsak M, Gallos L K, Havlin S, et al. Identification of influential spreaders in complex networks. Nat Phys, 2010, 6: 888–893
Kimura M, Saito K, Nakano R, et al. Extracting influential nodes on a social network for information diffusion. Data Min Knowl Disc, 2010, 20: 70–97
Lu Z, Zhang W, Wu W, et al. The complexity of influence maximization problem in the deterministic linear threshold model. J Comb Optim, 2012, 24: 374–378
Kempe D, Kleinberg J, Tardos E. Maximizing the spread of influence through a social network. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, 2003. 137–146
Richardson M, Domingos P. Mining knowledge-sharing sites for viral marketing. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, 2002. 61–70
Cosley D, Huttenlocher D P, Kleinberg J M, et al. Sequential influence models in social networks. In: Proceedings of AAAI ICWSM, Washington, 2010. 26–33
Leskovec J, Krause A, Guestrin C, et al. Cost-effective outbreak detection in networks. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, 2007. 420–429
Watts D J. A simple model of global cascades on random networks. Proc Natl Acad Sci, 2002, 99: 5766–5771
Miorandi D, De Pellegrini F. K-Shell decomposition for dynamic complex networks. In: Proceedings of the 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks (WiOpt), Avignon, 2010. 488–496
Chen W, Wang Y, Yang S. Efficient influence maximization in social networks. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, 2009. 199–208
Jiang Q, Song G, Cong G, et al. Simulated annealing based influence maximization in social networks. In: Proceedings of AAAI, San Francisco, 2011. 127–132
Wang Y, Cong G, Song G, et al. Community based greedy algorithm for mining top-k influential nodes in mobile social networks. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, 2010. 1039–1048
Canright G S, Engø Monsen K. Spreading on networks: a topographic view. Complexus, 2006, 3: 131–146
Inoue K, Yokomori R, Yamamoto T, et al. Ranking significance of software components based on use relations. IEEE Trans Softw Eng, 2005, 31: 213–225
Vasa R, Schneider J G, Nierstrasz O. The inevitable stability of software change. In: Proceedings of IEEE International Conference on Software Maintenance, Paris, 2007. 413–422
Martin González A M, Dalsgaard B, Olesen J M. Centrality measures and the importance of generalist species in pollination networks. Ecol Complex, 2010, 7: 36–43
Li C T, Shan M K, Lin S D. Dynamic selection of activation targets to boost the influence spread in social networks. In: Proceedings of the 21st International Conference Companion on World Wide Web, Lyon, 2012. 561–562
Zhang Y, Gu Q, Zheng J, et al. Estimate on expectation for influence maximization in social networks. In: Proceedings of the 14th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, Hyderabad, 2010. 99–106
Newman M E J. Power laws, pareto distributions and Zipf’s law. Contemp Phys, 2005, 46: 323–351
Marquardt D W. An algorithm for least-squares estimation of nonlinear parameters. SIAM J Appl Math, 1963, 11: 431–441
Mitzenmacher M. A brief history of generative models for power law and lognormal distributions. Internet Math, 2004, 1: 226–251
Molloy M, Reed B. A critical point for random graphs with a given degree sequence. Random Struct Algor, 1995, 6: 161–180
Myers J L, Well A D. Research Design and Statistical Analysis. Lawrence Erlbaum Associates, 2002
Hall T, Beecham S, Bowes D, et al. A systematic literature review on fault prediction performance in software engineering. IEEE Trans Softw Eng, 2012, 38: 1276–1304
Zhou Y M, Xu B W, Leung H. On the ability of complexity metrics to predict fault-prone classes in object-oriented systems. J Syst Software, 2010, 83: 660–674
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Gu, Q., Xiong, S. & Chen, D. Correlations between characteristics of maximum influence and degree distributions in software networks. Sci. China Inf. Sci. 57, 1–12 (2014). https://doi.org/10.1007/s11432-013-5047-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11432-013-5047-7