Abstract
Introduced in the context of machine learning, the Feature F-measure is a statistical feature selection metric without parameters that allows to describe classes through a set of salient features. It was shown efficient for classification, cluster labeling and clustering model quality measurement. In this paper, we introduce the Node F-measure, its transposition in the context of networks, where it can by analogy be applied to detect salient nodes in communities. This approach benefits from the parameter-free system of Feature F-Measure, its low computational complexity and its well-evaluated performance. Interestingly, we show that in addition to these properties, Node F-measure is correlated with certain centrality measures, and with measures designed to characterize the community roles of nodes. We also observe that the usual community roles measures are strongly dependent from the size of the communities whereas the ones we propose are by definition linked to the density of the community. This hence makes their results comparable from one network to another. Finally, the parameter-free selection process applied to nodes allows for a universal system, contrary to the thresholds previously defined empirically for the establishment of community roles. These results may have applications regarding leadership in scientific communities or when considering temporal monitoring of communities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Barabási, A.L.: Network Science. Cambridge University Press, Cambridge (2016)
Brin, S., Page, L.: Reprint of: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. 56(18), 3825–3833 (2012)
Dugué, N., Labatut, V., Perez, A.: A community role approach to assess social capitalists visibility in the twitter network. Soc. Netw. Anal. Min. 5(1), 1–13 (2015)
Freeman, L.C.: Centrality in social networks conceptual clarification. Soc. Netw. 1(3), 215–239 (1979)
Guimerà, R., Amaral, L.: Functional cartography of complex metabolic networks. Nature 433, 895–900 (2005)
Klimm, F., Borge-Holthoefer, J., Wessel, N., Kurths, J., Zamora-López, G.: Individual node’s contribution to the mesoscale of complex networks. New J. Phys. 16(12), 125006 (2014)
Kunegis, J.: Konect: the koblenz network collection. In: WWW, pp. 1343–1350 (2013)
Lamirel, J.C., Cuxac, P., Chivukula, A., Hajlaoui, K.: Optimizing text classification through efficient feature selection based on quality metric. J. I. IS 45(3), 379–396 (2015)
Lamirel, J.C., Dugué, N., Cuxac, P.: New efficient clustering quality indexes. In: IJCNN (2016)
Lamirel, J.C., Falk, I., Gardent, C.: Federating clustering and cluster labelling capabilities with a single approach based on feature maximization: French verb classes identification with igngf. Neurocomputing 147, 136–146 (2015)
Lancichinetti, A., Fortunato, S., Radicchi, F.: Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78(4), 046110 (2008)
Lancichinetti, A., Kivelä, M., Saramäki, J., Fortunato, S.: Characterizing the community structure of complex networks. PloS one 5(8), e11976 (2010)
Newman, M.E., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. 69(2), 026113 (2004)
Rosvall, M., Bergstrom, C.T.: Maps of random walks on complex networks reveal community structure. Proc. Natl. Acad. Sci. 105(4), 1118–1123 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Dugué, N., Lamirel, JC., Perez, A. (2019). Bringing a Feature Selection Metric from Machine Learning to Complex Networks. In: Aiello, L., Cherifi, C., Cherifi, H., Lambiotte, R., Lió, P., Rocha, L. (eds) Complex Networks and Their Applications VII. COMPLEX NETWORKS 2018. Studies in Computational Intelligence, vol 813. Springer, Cham. https://doi.org/10.1007/978-3-030-05414-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-05414-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05413-7
Online ISBN: 978-3-030-05414-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)