Abstract
In the real world, many systems can be represented as a network, in which the nodes denote the objects of interest and the edges describe the relations between them, such as telecommunication networks, power grid networks, and email communication networks. These complex networks have been revealed to possess many common statistical properties such as scale-free nature and small-world property. In addition, modularity or community structure is another important characteristic of complex networks. Identifying modular structure can help us understand the function of networks. In this paper, we introduce a method based on information-theoretic clustering for finding communities/modules in complex networks. This method is robust to the feature representation of networks. Moreover, unlike most existing algorithms, this method does not need to search the number of communities in a network and can determine it automatically. We apply this method to several well-studied networks including a large-scale email communication network and the computational results demonstrate its effectiveness.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Girvan, M., Newman, M.E.J.: Community Structure in Social and Biological Networks. Proc. Natl. Acad. Sci. USA 99, 7821–7826 (2002)
Gibson, D., Kleinberg, J., Raghavan, P.: Inferring Web Communities from Link Topology. In: Proceedings of the 9th ACM Conference on Hypertext and Hypermedia. ACM Press, New York (1998)
Newman, M.E.J., Forrest, S., Balthrop, J.: E-mail Networks and the Spread of Computer Viruses. Phys. Rev. EÂ 66, 035101 (2002)
Ebel, H., Mielsch, L.I., Bornholdt, S.: Scale-free Topology of E-mail Networks. Phys. Rev. EÂ 66, 035103 (2002)
Albert, R., Barabási, A.: Statistical Mechanics of Complex Networks. Reviews of Modern Physics 74, 47–97 (2002)
Boykin, P.O., Roychowdhury, V.P.: Leveraging Social Networks to Fight Spam. Computer 38, 61–68 (2005)
Kong, J.S., Rezaei, B.A., Sarshar, N., Roychowdhury, V.P., Boykin, P.O.: Collaborative Spam Filtering Using E-mail networks. Computer 39, 67–73 (2006)
Guimerà , R., Danon, L., Diaz-Guilera, A., Giralt, E., Arenas, A.: The Real Communication Network Behind the Formal Chart: Community Structure in Organizations. Journal of Economic Behavior & Organization 61, 653–667 (2006)
Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)
Angelini, L., Boccaletti, S., Marinazzo, D., Pellicoro, M., Stramaglia, S.: Fast Identification of Network Modules by Optimization of Ratio Association. Chaos 17, 023114 (2007)
Pallal, G., Derenyi, I., Farkasl, I., Vicsek, T.: Uncovering the Overlapping Community Structure of Complex Networks in Nature and Society. Nature 435, 814–818 (2005)
Newman, M.E.J.: Modularity and Community Structure in Networks. Proc. Natl Acad. Sci. USA 103(23), 8577–8582 (2006)
Guimerà , R., Amaral, L.A.N.: Functional Cartography of Complex Metabolic Networks. Nature 438, 895–900 (2005)
Newman, M.E.J.: Fast Algorithm for Detecting Community Structure in Networks. Phys. Rev. EÂ 69, 066133 (2004)
Gustafsson, M., Hörnquista, M., Lombardi, A.: Comparison and Validation of Community Structures in Complex Networks. Physica A 367, 559–576 (2006)
Danon, L., Daz-Guilera, A., Duch, J., Arenas, A.: Comparing Community Structure Identification. J. Statist. Mech.: Theory and Experiment 09, P09008 (2005)
Fortunato, S.: Community Detection in Graphs. Physics Reports 486(3-5), 75–174 (2010)
Lancichinetti, A., Fortunato, S.: Community Detection Algorithms: A Comparative Analysis. Phys. Rev. EÂ 80, 056117 (2009)
White, S., Smyth, P.: A Spectral Clustering Approach to Finding Communities in Graphs. In: SIAM International Conference on Data Mining, Newport Beach, CA, USA (2005)
Kondor, R.I., Lafferty, J.: Diffusion Kernels on Graphs and Other Discrete Input Space. In: Proc. ICML 2002, pp. 315–322. Morgan Kaufmann Publishers Inc., San Francisco (2002)
Slonim, N., Atwal, G.S., Tkacik, G., Bialek, W.: Information-based Clustering. Proc. Natl. Acad. Sci. USA 102(51), 18297–18302 (2005)
Newman, M.E.J., Girvan, M.: Finding and Evaluating Community Structure in Networks. Phys. Rev. EÂ 69, 026113 (2004)
Cover, T.M., Thomas, J.A.: Elements of Information Theory. Wiley, New York (1991)
Guimerà , R., Danon, L., DÃaz-Guilera, A., Giralt, F., Arenas, A.: Self-similar Community Structure in a Network of Human Interactions. Phys. Rev. E 68, 65103 (2003)
Leung, I.X.Y., Hui, P., Liò, P., Crowcroft, J.: Towards Real-time Community Detection in Large Networks. Phys. Rev. E 79, 066107 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, Y., Wang, G. (2010). Structure Analysis of Email Networks by Information-Theoretic Clustering. In: Zhang, L., Lu, BL., Kwok, J. (eds) Advances in Neural Networks - ISNN 2010. ISNN 2010. Lecture Notes in Computer Science, vol 6064. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13318-3_52
Download citation
DOI: https://doi.org/10.1007/978-3-642-13318-3_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13317-6
Online ISBN: 978-3-642-13318-3
eBook Packages: Computer ScienceComputer Science (R0)