ABSTRACT
As the world is flooded with deluge of data, the demand for mining data to gain insights is increasing. One effective technique to deal with the problem is to model the data as networks (graphs) and then apply graph mining techniques to uncover useful patterns. Several graph mining techniques have been studied in the literature, and graphlet-based analysis is gaining popularity due to its power in exposing hidden structure and interaction within the networks.
The concept of graphlets for basic (undirected) networks was introduced around 2004 by Pržulj, et. al. [14]. Subsequently, graphlet based network analysis gained attraction when Pržulj added the concept of graphlet orbits and applied to biological networks [15]. A decade later, Sarajlić, et. al. introduced graphlets and graphlet orbits for directed networks, illustrating its application to fields beyond biology such as world trade networks, brain networks, communication networks, etc. [19]. Hence, directed graphlets are found to be more powerful in exposing hidden structures of the network than undirected graphlets of same size, due to added information on the edges. Taking this approach further, more recently, graphlets and orbits for signed networks have been introduced by Dale [3]. This paper presents a simple algorithm to enumerate signed graphlets and orbits. It then demonstrates an application of signed graphlets and orbits to a metabolic network.
- Aparicio, D., Robeiro, P., Silva, F.: Extending the applicability of graphlets to directed networks, IEEE ACM Transactions on Computational Biology and Bioinformatics, 14(6):1302-1315, 2017.Google ScholarDigital Library
- Becchetti, L., Boldi, P., Castillo, C., Gionis, A.: Efficient semi-streaming algorithms for local triangle counting in massive graphs, Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, 16--24, 2008.Google ScholarDigital Library
- Dale, M.: Applying Graph Theory in Ecological Research, Cambridge University Press, 2017.Google ScholarCross Ref
- Davis, D., Yaveroğlu, Ö. N., Malod-Dognin, N., Stojmirovic, A., Pržulj, N.: Topology-function conservation in protein-protein interaction networks, Bioinformatics, 31(10):1632-1639, 2015.Google Scholar
- Holland, P. W., Leinhardt, S.: Local structure in social networks, Sociological Methodology, 7:1-45, 1976.Google ScholarCross Ref
- Kolountzakis, M. N., Miller, G. L., Peng, R., Tsourakakis, C. E.: Efficient triangle counting in large graphs via degree-based vertex partitioning, Internet Mathematics, 8(1-2):161-185, 2012.Google ScholarCross Ref
- Li, W., Schuurmans, D.: Modular community detection in networks, Proc. of the 22nd Intl. Conf. on Artificial Intelligence, 1366--1371, 2011.Google Scholar
- Milenković, T., Lai, J., Pržulj, N.: Graphcrunch: a tool for large network analyses, BMC Bioinformatics, 9(70):1-11, 2008.Google Scholar
- Milenković, T., Memišević, V., Ganesan, A. K., Pržulj, N.: Systems-level cancer gene identification from protein interaction network topology applied to melanogenesis-related functional genomics data, Journal of the Royal Society Interface, 7(44):423-437, 2009.Google ScholarCross Ref
- Milenković, T., and Pržulj, N.: Uncovering biological network function via graphlet degree signatures, Cancer Informatics, 6:1-10, 2008.Google ScholarCross Ref
- Marcus, D., Shavitt, Y.: RAGE-a rapid graphlet enumerator for large networks, Computer Networks, 56(2):810-819, 2012.Google ScholarDigital Library
- Pagh R., Tsourakakis, C.: Colorful triangle counting and a MapReduce implementation, Information Processing Letters, 112(7):277-281, 2012.Google ScholarDigital Library
- Pearl, J., Glymour, M., Jewell, N. P.: Causal Inference in Statistics, Wiley, 2016.Google Scholar
- Pržulj, N., Corneil, D. G., Jurisica, I.: Modeling : scale-free or geometric? Bioinformatics, 20(18):3508-3515, 2004.Google ScholarDigital Library
- Pržulj, N: Biological network comparison using graphlet degree distribution, Bioinformatics, 23(2):177-183, 2007.Google ScholarDigital Library
- Rossi, R. A., Ahmed, N. K.: The network data repository with interactive graph analytics and visualization, Proceedings of the 29th AAAI Conference on Artificial Intelligence, 4292--4293, 2015.Google Scholar
- Leskovec, Jure and Sosič, Rok: SNAP: A general-purpose network analysis and graph-mining library, ACM Transactions on Intelligent Systems and Technology (TIST) 8(1):1:1-20 (2016).Google Scholar
- Schank, T.: Algorithmic Aspects of Triangle-Based Network Analysis, PhD Thesis, Universitiat Karlsruhe, 2007.Google Scholar
- Sarajlić, A., Malod-Dognin, N., Yaveroğlu, Ö. N., Pržulj, N.: Graphlet-based characterization of directed networks, Nature Scientific Reports, 6(35098):1-14, 2016.Google Scholar
- Sonmez, A. B., and Can, T.: Comparison of tissue/disease specific integrated networks using directed graphlet signatures, BMC bioinformatics, 18(4):41-50, 2017.Google Scholar
- Suri, S., Vassilvitskii, S.: Counting triangles and the curse of the last reducer, Proceedings of the 20th international conference on world wide web, 607--614, 2011.Google ScholarDigital Library
- Schank, T., Wagner, D.: Finding, counting and listing all triangles in large graphs, an experimental study, Proc. of the international workshop on experimental and efficient algorithms, Springer, 606--609, 2005.Google ScholarDigital Library
- Yaveroğlu, N.: Graphlet correlations for network comparison and modelling: World trade network example, Thesis, Imperial College, 2014.Google Scholar
- Yaveroğlu, et. al.: Revealing the hidden language of complex networks, Nature Scientific Reports, 4(4547):1-9, 2014.Google Scholar
- Toubiana, D. et. al.: Correlation-based network analysis of metabolite and enzyme profiles reveals a role of citrate biosynthesis in modulating N and C metabolism in Zea mays, Frontiers in Plant Science, 7(1022):1-10, 2016.Google Scholar
- Algorithm and application for signed graphlets
Recommendations
Heterogeneous Graphlets
In this article, we introduce a generalization of graphlets to heterogeneous networks called typed graphlets. Informally, typed graphlets are small typed induced subgraphs. Typed graphlets generalize graphlets to rich heterogeneous networks as they ...
Extending the Applicability of Graphlets to Directed Networks
With recent advances in high-throughput cell biology, the amount of cellular biological data has grown drastically. Such data is often modeled as graphs also called networks and studying them can lead to new insights into molecule-level organization. A ...
Degree choosable signed graphs
A signed graph is a graph in which each edge is labeled with +1 or 1. A (proper) vertex coloring of a signed graph is a mapping that assigns to each vertex vV(G) a color (v)Z such that every edge vw of G satisfies (v)(vw)(w), where (vw) is the sign of ...
Comments