skip to main content
10.1145/1830252.1830267acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Pruthak: mining and analyzing graph substructures

Authors Info & Claims
Published:24 July 2010Publication History

ABSTRACT

In many scientific and commercial domains, graph as a data structure has become increasingly important for modeling of sophisticated structures. In the past few years, there has been sharp increase in research on mining graph data. We had proposed a unified framework for graph mining and analysis of extracted substructures, which was then an unattended task. Pruthak, a graph mining tool is developed based on this proposed framework. The tool provides preprocessing, frequent substructure discovery, dense substructure extraction and visualization techniques for graph representation of data. In this paper we discuss the approach taken in design and implementation of Pruthak. We then talk about our study on the Digital Bibliography & Library Project (DBLP) dataset for mining and analyzing substructures using this tool. The study results have demonstrated the intended correctness and usability of the tool.

References

  1. R. Agrawal, and R. Srikant, "Fast algorithms for mining association rules", In Proceedings of 1994 International Conference Very Large Data Bases (VLDB'94), pp. 487--499, Santiago, Chile, Sept. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Auber. Tulip. In P. Mutzel, M. Jünger, and S. Leipert, editors, 9th Symp. Graph Drawing, volume 2265 of Lecture Notes in Computer Science, pages 335--337. Springer-Verlag, 2001.Google ScholarGoogle Scholar
  3. V. Batagelj and A. Mrvar. Pajek---analysis and visualization of large networks. In M. Jünger and P. Mutzel, eds. Graph Drawing Software, pp. 77--103. Springer, 2003.Google ScholarGoogle Scholar
  4. G. D. Battista, W. Didimo, M. Patrignani, M. Pizzonia "Drawing Database Schemas with DBdraw", Graph Drawing 2001: 451--452Google ScholarGoogle Scholar
  5. I. M. Bomze, M. Budinich, P. M. Pardalos, and M. Pelillo, "The maximum clique problem", Handbook of Combinatorial Optimization, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  6. U. Brandes and D. Wagner. Visone---analysis and visualization of social networks. In M. Jünger and P. Mutzel, eds. Graph Drawing Software, pp. 321--340. Springer, Berlin, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. U. Brandes, P. Kenis, and D. Wagner. "Communicating centrality in policy network drawings", IEEE Transactions on Visualization and Computer Graphics, 9(2):241--253, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Chimani, C. Gutwenger, M. Jünger, K. Klein, P. Mutzel, M. Schulz. "The Open Graph Drawing Framework.", 15th International Symposium on Graph Drawing 2007, Sydney (GD07).Google ScholarGoogle Scholar
  9. D. Cook, and L. Holder, "Mining graph data", Wiley Publication, 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. Faloutsos, K. S. McCurley, and A. Tomkins, "Fast discovery of connection subgraphs", In Proceedings of 10th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 118--127, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Gibson, R. Kumar, and A. Tomkins, "Extracting large dense subgraphs in massive graphs", In Proceedings of 31st International Conference on Very Large Data Bases, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Han, J. Pei, and Y. Yin, "Mining frequent patterns without candidate generation", In Proceedings of 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD'00), pp. 1--12, Dallas, TX, May 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Huan, W. Wang, J. Prins, and J. Yang, "SPIN: Mining maximal frequent subgraphs from graph databases", In Proceedings of 2004 ACM SIGKDD International Conference on Knowledge Discovery in Databases (KDD'04), pp. 581--586, Seattle, WA, Aug. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Inokuchi, T. Washio, and H. Motoda, "An apriori-based algorithm for mining frequent substructures from graph data", In Proceedings of 2000 European Symposium Principle of Data Mining and Knowledge Discovery (PKDD'00), pp. 13--23, Lyon, France, Sept. 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. Joshi and R. Joshi, "Concept based class Cohesion Analysis," IEEE proceedings of 13th European Conference on Software Maintenance and Reverse Engineering (CSMR), 2009, Kaiserslautern, Germany, March 24--27, 2009, pp. 237--241. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, "Trawling the web for emerging cyber-communities", WWW/Computer Networks, 31(11--16):1481--1493, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Kuramochi, and G. Karypis, "Frequent subgraph discovery", In Proceedings of 2001 International Conference on Data Mining (ICDM'01), pp. 313--320, San Jose, CA, Nov. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Ley, "DBLP --- Some Lessons Learned", VLDB '09, August 24--28, 2009, Lyon, France Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Madaadhain, D. Fisher, P. Smyth, S. White, and Y. B. Boey, "Analysis and Visualization of Network Data Using JUNG", J. Statistical Software.Google ScholarGoogle Scholar
  20. J. F. Rodrigues Jr, H. Tong, Agma J. M. Traina, C. Faloutsos, and J. Leskovec, "GMine: a system for scalable, interactive graph visualization and mining", Proceedings of the 32nd international conference on Very large data bases, September 12--15, 2006, Seoul, Korea. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. Sugiyama, S. Tagawa, and M. Toda. "Methods for visual understanding of hierarchical systems", IEEE Transactions on Systems, Man, and Cybernetics, SMC-11(2):109125, 1981.Google ScholarGoogle ScholarCross RefCross Ref
  22. S. Shrivastava, "Finding topical clusters by mining maximum clique / near clique in cocitation graph", MS thesis, BITS Pilani, India, 2007.Google ScholarGoogle Scholar
  23. S. Shrivastava, and S. N. Pal, "Graph mining framework for finding and visualizing substructures using graph database.", Advances in Social Network Analysis and Mining (ASONAM), Athens, Greece, Jul 2009, pp. 379--380. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Shrivastava, S; Singh P; Kulshrestha K; and Pal S N, "Informative Graph Visualization for Graph Mining and Code Refactoring Applications", IEEE Pacific Visualization Symposium, March 2010, poster presentation.Google ScholarGoogle Scholar
  25. R. Tamassia. "Advances in the theory and practice of graph drawing", Theoretical Computer Science, 17:235--254, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. R. Tamassia, G. Di Battista, and C. Batini. "Automatic graph drawing and readability of diagrams", IEEE Transactions on Systems, Man, and Cybernetics, SMC-18(1):6179, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. X. Yan, and J. Han, "gSpan: Graph-based substructure pattern mining", In Proceedings of 2002 International Conference on Data Mining (ICDM'02), pp. 721--724, Maebashi, Japan, Dec. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. http://dblp.uni-trier.de/xml/Google ScholarGoogle Scholar
  29. http://dblpVis.uni-trier.deGoogle ScholarGoogle Scholar
  30. http://www.jfree.org/jfreechart/Google ScholarGoogle Scholar
  31. http://jgrapht.sourceforge.net/Google ScholarGoogle Scholar
  32. http://jung.sourceforge.net/Google ScholarGoogle Scholar
  33. http://gvf.sourceforge.net/Google ScholarGoogle Scholar
  34. http://sourceforge.net/projects/sonivis/Google ScholarGoogle Scholar
  35. http://toscanaj.sourceforge.net/Google ScholarGoogle Scholar
  36. http://www.aisee.com/Google ScholarGoogle Scholar
  37. http://www.babelgraph.org/Google ScholarGoogle Scholar
  38. http://www.cs.waikato.ac.nz/~ml/weka/Google ScholarGoogle Scholar
  39. http://www.graphviz.org/Google ScholarGoogle Scholar
  40. http://www.graph-magics.com/Google ScholarGoogle Scholar
  41. http://www.jgraph.com/pub/jgraphmanual.pdfGoogle ScholarGoogle Scholar
  42. http://www.jgraph.com/mxgraph.htmlGoogle ScholarGoogle Scholar
  43. http://www.jgraph.com/layout.htmlGoogle ScholarGoogle Scholar
  44. http://www.oreas.com/libraries_en.phpGoogle ScholarGoogle Scholar
  45. http://www.tomsawyer.com/products/index.phpGoogle ScholarGoogle Scholar
  46. http://www.yworks.com/products/yfiles/doc/developers-guide/index.htmlGoogle ScholarGoogle Scholar

Index Terms

  1. Pruthak: mining and analyzing graph substructures

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader