skip to main content
research-article

Efficient Directed Densest Subgraph Discovery

Published:17 June 2021Publication History
Skip Abstract Section

Abstract

Given a directed graph G, the directed densest subgraph (DDS) problem refers to the finding of a subgraph from G, whose density is the highest among all the subgraphs of G. The DDS problem is fundamental to a wide range of applications, such as fraud detection, community mining, and graph compression. However, existing DDS solutions suffer from efficiency and scalability problems: on a threethousand- edge graph, it takes three days for one of the best exact algorithms to complete. In this paper, we develop an efficient and scalable DDS solution. We introduce the notion of [x, y]-core, which is a dense subgraph for G, and show that the densest subgraph can be accurately located through the [x, y]-core with theoretical guarantees. Based on the [x, y]-core, we develop both exact and approximation algorithms. We have performed an extensive evaluation of our approaches on eight real large datasets. The results show that our proposed solutions are up to six orders of magnitude faster than the state-of-the-art.

References

  1. R. Albert, H. Jeong, and A.-L. Barab´asi. Internet: Diameter of the world-wide web. nature, 401(6749):130, 1999.Google ScholarGoogle Scholar
  2. B. Bahmani, R. Kumar, and S. Vassilvitskii. Densest subgraph in streaming and mapreduce. PVLDB, 5(5):454--465, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. V. Batagelj and M. Zaversnik. An o(m) algorithm for cores decomposition of networks. 2003.Google ScholarGoogle Scholar
  4. A. Capocci, V. D. Servedio, F. Colaiori, L. S. Buriol, D. Donato, S. Leonardi, and G. Caldarelli. Preferential attachment in the growth of social networks: The internet encyclopedia wikipedia. Physical Review E, 74(3):036116, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  5. M. Charikar. Greedy approximation algorithms for finding dense components in a graph. In APPROX, pages 84--95. Springer, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Danisch, T.-H. H. Chan, and M. Sozio. Large scale density-friendly graph decomposition via convex programming. In WWW, pages 233--242, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Fang, K. Yu, R. Cheng, L. V. Lakshmanan, and X. Lin. Efficient algorithms for densest subgraph discovery. PVLDB, 12(11):1719 -- 1732, 2019. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Gionis and C. E. Tsourakakis. Dense subgraph discovery: Kdd 2015 tutorial. In KDD, pages 2313--2314, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. V. Goldberg. Finding a maximum density subgraph. University of California Berkeley, CA, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. Heineman, G. Pollice, and S. Selkow. Network flow algorithms. algorithms in a nutshell, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. Hooi, H. A. Song, A. Beutel, N. Shah, K. Shin, and C. Faloutsos. Fraudar: Bounding graph fraud in the face of camouflage. In KDD, pages 895--904, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In WebKDD, pages 56--65, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Kannan and V. Vinay. Analyzing the structure of large graphs. University of Bonn, 1999.Google ScholarGoogle Scholar
  14. S. Khuller and B. Saha. On finding dense subgraphs. In ICALP, pages 597--608. Springer, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. M. Kleinberg. Authoritative sources in a hyperlinked environment. JACM, 46(5):604--632, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Kunegis. KONECT -- The Koblenz Network Collection. In WWW, pages 1343--1350, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C. Ma, R. Cheng, L. V. Lakshmanan, T. Grubenmann, Y. Fang, and X. Li. Linc: a motif counting algorithm for uncertain graphs. PVLDB, 13(2):155--168, 2019. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Ma, Y. Fang, R. Cheng, L. V. Lakshmanan, W. Zhang, and X. Lin. Efficient algorithms for densest subgraph discovery on large directed graphs. In SIGMOD, pages 1051--1066, 2020. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. B. Orlin. Max flows in o (nm) time, or better. In STOC, pages 765--774, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. A. Prakash, A. Sridharan, M. Seshadri, S. Machiraju, and C. Faloutsos. Eigenspokes: Surprising patterns and scalable community chipping in large graphs. In PAKDD, pages 435--448, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Sawlani and J. Wang. Near-optimal fully dynamic densest subgraph. In STOC, pages 181--193, 2020. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. B. Seidman. Network structure and minimum degree. Social networks, 5(3):269--287, 1983.Google ScholarGoogle ScholarCross RefCross Ref
  23. N. Tatti and A. Gionis. Density-friendly graph decomposition. In WWW, pages 1089--1099, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient Directed Densest Subgraph Discovery
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader