skip to main content
10.1145/3448016.3457251acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

MIDAS: Towards Efficient and Effective Maintenance of Canned Patterns in Visual Graph Query Interfaces

Authors Info & Claims
Published:18 June 2021Publication History

ABSTRACT

Several visual graph query interfaces (a.k.a gui) expose a set of canned patterns (i.e., small subgraph patterns) to expedite subgraph query formulation by enabling pattern-at-a-time construction. Unfortunately, manual generation of canned patterns is not only labour intensive but also may lack diversity to support efficient visual formulation of a wide range of subgraph queries. Recent efforts have taken a data-driven approach to select high-quality canned patterns for a gui automatically from the underlying graph database. However, as the underlying database evolves, these selected patterns may become stale and adversely impact efficient query formulation. In this paper, we present a novel framework called Midas for efficient and effective maintenance of the canned patterns as the database evolves. Specifically, it adopts a selective maintenance strategy that guarantees progressive gain of coverage of the patterns without sacrificing their diversity and cognitive load. Experimental study with real-world datasets and visual graph interfaces demonstrates the effectiveness of Midas compared to static guis.

Skip Supplemental Material Section

Supplemental Material

3448016.3457251.mp4

mp4

223.1 MB

References

  1. textscaids dataset. https://wiki.nci.nih.gov/display/NCIDTPdata/AIDSGoogle ScholarGoogle Scholar
  2. AntiviralGoogle ScholarGoogle Scholar
  3. ScreenGoogle ScholarGoogle Scholar
  4. Data.Google ScholarGoogle Scholar
  5. rugBank interface. https://go.drugbank.com/structures/search/small_molecule_drugs/structure.Google ScholarGoogle Scholar
  6. Molecules dataset. https://www.emolecules.com/info/plus/download-database.Google ScholarGoogle Scholar
  7. ubChem dataset. ftp://ftp.ncbi.nlm.nih.gov/pubchem/Compound/CURRENT-Full/SDF/.Google ScholarGoogle Scholar
  8. ubChem interface. https://pubchem.ncbi.nlm.nih.gov//edit3/index.html.Google ScholarGoogle Scholar
  9. . Abdelhamid, M. Canim, M. Sadoghi, B. Bhattacharjee, Y.C. Chang, P. Kalnis. Incremental frequent subgraph mining on large evolving graphs. IEEE T. Knowl. Data En. 29(12):2710--2723, 2017.Google ScholarGoogle ScholarCross RefCross Ref
  10. . Arora, E. Hazan, S. Kale. The multiplicative weights update method: a meta-algorithm and applications. Theory Comput. 8(1), 2012.Google ScholarGoogle Scholar
  11. . Arthur, S. Vassilvitskii. K-meansGoogle ScholarGoogle Scholar
  12. : The advantages of careful seeding. In SIAM, 2007.Google ScholarGoogle Scholar
  13. .L. Balcázar, A. Bifet, A. Lozano. Mining frequent closed rooted trees. Machine Learning, 78(1--2):1, 2010.Google ScholarGoogle Scholar
  14. . Bifet, R. Gavald$gravea$. Mining adaptively frequent closed unlabeled rooted trees in data streams. In SIGKDD, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. . Bifet, R. Gavald$gravea$. Mining frequent closed trees in evolving data streams. Intell. Data Anal., 15(1):29--48, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. .S. Bhowmick, B. Choi, C.E. Dyreson. Data-driven visual graph query interface construction and maintenance: challenges and opportunities. PVLDB, 9(12):984--992, 2016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. . S. Bhowmick, K. Huang, et al. AURORA: data-driven construction of visual graph query interfaces for graph databases. In SIGMOD, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. . Bonnici, A. Ferro, R. Giugno, A. Pulvirenti, D. Shasha. Enhancing graph database indexing by suffix tree structure. In IAPR PRIB, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  19. J. Cheng, Y. Ke, W. Ng, et al. Fg-index: towards verification-free query processing on graph databases. In SIGMOD, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. . Cheung, O.R. Zaiane. Incremental mining of frequent patterns without candidate generation or support constraint. In IDEAS, 2003.Google ScholarGoogle Scholar
  21. .P. Cordella, P. Foggia, et al. A (sub)graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell., 26(10):1367--1372, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. . Fan, J. Li, S. Ma, N. Tang, Y. Wu and Y. Wu. Graph pattern matching: from intractability to polynomial time. In PVLDB, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. . Fan, J. Li, J. Luo, Z. Tan, X. Wang, Y. Wu. Incremental graph pattern matching. In SIGMOD, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. . Fan, C. Hu, C. Tian. Incremental graph computations: doable and undoable. In SIGMOD, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. .L. Faulkner. Beyond the five-user assumption: Benefits of increased sample sizes in usability testing. Behavior Research Methods, Instruments, & Computers, 35(3), 2003.Google ScholarGoogle Scholar
  26. . He, A.K. Singh. Closure-tree: An index structure for graph queries. In ICDE, 2006.Google ScholarGoogle Scholar
  27. . Huang, H.E. Chua, S.S. Bhowmick, B. Choi, S. Zhou. CATAPULT: data-driven selection of canned patterns for efficient visual graph query formulation. In SIGMOD, 2019.Google ScholarGoogle Scholar
  28. . Huang, H.E. Chua, S.S. Bhowmick, B. Choi, S. Zhou. MIDAS: towards efficient and effective maintenance of canned patterns in visual graph query interfaces. Technical Report. Available at: https://github.com/MIDAS2020/Midas/blob/master/sigmod-midas_TR.pdf.Google ScholarGoogle Scholar
  29. . Huang, P. Eades, S.H. Hong. Measuring effectiveness of graph visualizations: A cognitive load perspective. Inf. Vis., 8(3):139--152, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. . Lazar, J.H. Feng, H. Hochheiser. Research methods in human-computer interaction. John Wiley & Sons, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. . Li, M. Semerci, B. Yener, M.J. Zaki. Graph classification via topological and label attributes. In MLG, 2011.Google ScholarGoogle Scholar
  32. .J. Llanos, J. Leal, W. Luu, D.H. Jost, P.F., Stadler, G. Restrepo. Exploration of the chemical space and its three historical regimes. PNAS, 116(26):12660--12665, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  33. .T. Marler, J.S. Arora. The weighted sum method for multi-objective optimization: new insights. Struct. Multidiscip. O., 41(6):853--862, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  34. . Morina. The trie data structure in Java. Available at https://www.baeldung.com/trie-java. Accessed on 30 September 2019.Google ScholarGoogle Scholar
  35. . Prvzulj. Biological network comparison using graphlet degree distribution. Bioinformatics, 23(2):e177-e183, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. . Riesen, M. Neuhaus, H. Bunke. Bipartite graph matching for computing the edit distance of graphs. In GbRPR, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. . Saha, L. Getoor. On maximum coverage in the streaming model & application to multi-topic blog-watch. In SDM, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  38. . Saha. An incremental bisimulation algorithm. In FSTTCS, 2007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. . Shang, X. Lin, Y. Zhang, J.X. Yu, W. Wang. Connected substructure similarity search. In SIGMOD, 2010.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. . Shneiderman, C. Plaisant. Desigining the user interface: strategies for effective human-computer interaction. 5th Ed., Addison-Wesley, 2010.Google ScholarGoogle Scholar
  41. . Tofallis. Add or multiply? A tutorial on ranking and choosing with multiple criteria. INFORMS Trans. on Education, 14(3): 109--119, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  42. . Wang, W. Hsu, M.L. Lee, C. Sheng. A partition-based approach to graph mining. In ICDE, 2006.Google ScholarGoogle Scholar
  43. . Yan, P. S. Yu, J. Han. Graph indexing: a frequent structure-based approach. In SIGMOD, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. . Yang, A.W.C. Fu, R. Liu. Diversified top-k subgraph querying in a large graph. In SIGMOD, 2016.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. . Yoghourdjian, D. W. Archambault, et al.Exploring the limits of complexity: A survey of empirical studies on graph visualisation. Visual Informatics ,2(4): 264--282, 2018.Google ScholarGoogle ScholarCross RefCross Ref
  46. . Yuan, P. Mitra, H. Yu, C.L. Giles. Iterative graph feature mining for graph indexing. In ICDE, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. . Yuan, P. Mitra, H. Yu, C.L. Giles. Updating graph indices with a one-pass algorithm. In SIGMOD, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. . Zhang, S.S. Bhowmick, H.H. Nguyen, B. Choi, F. Zhu. DaVinci: Data-driven visual interface construction for subgraph search in graph databases. In ICDE, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  49. . Zhang, M. Hu, J. Yang. Treepi: A novel graph indexing method. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  50. . Zhao, J.X. Yu, P.S. Yu. Graph indexing: treeGoogle ScholarGoogle Scholar
  51. delta= graph. In VLDB, 2007.Google ScholarGoogle Scholar
  52. . Zou, L. Chen, J.X. Yu, Y. Lu. A novel spectral coding in a large graph database. In EDBT, 181--192, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. MIDAS: Towards Efficient and Effective Maintenance of Canned Patterns in Visual Graph Query Interfaces

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '21: Proceedings of the 2021 International Conference on Management of Data
      June 2021
      2969 pages
      ISBN:9781450383431
      DOI:10.1145/3448016

      Copyright © 2021 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 June 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader