skip to main content
10.1145/1099554.1099708acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Compact reachability labeling for graph-structured data

Published: 31 October 2005 Publication History

Abstract

Testing reachability between nodes in a graph is a well-known problem with many important applications, including knowledge representation, program analysis, and more recently, biological and ontology databases inferencing as well as XML query processing. Various approaches have been proposed to encode graph reachability information using node labeling schemes, but most existing schemes only work well for specific types of graphs. In this paper, we propose a novel approach, HLSS(Hierarchical Labeling of Sub-Structures), which identifies different types of substructures within a graph and encodes them using techniques suitable to the characteristics of each of them. We implement HLSS with an efficient two-phase algorithm, where the first phase identifies and encodes strongly connected components as well as tree substructures, and the second phase encodes the remaining reachability relationships by compressing dense rectangular submatrices in the transitive closure matrix. For the important subproblem of finding densest submatrices, we demonstrate the hardness of the problem and propose several practical algorithms. Experiments show that HLSS handles different types of graphs well, while existing approaches fall prey to graphs with substructures they are not designed to handle.

References

[1]
R. Agrawal, A. Borgida, and H. V. Jagadish. Efficient management of transitive relationships in large data and knowledge bases. In Proc. of the 1989 ACM SIGMOD Intl. Conf. on Management of Data, 1989.
[2]
H. Ait-Kaci, R. Boyer, P. Lincoln, and R. Nasr. Efficient implementation of lattice operations. ACM Trans. Program. Lang. Syst., 11(1):115--146, 1989.
[3]
S. Al-Khalifa, H. V. Jagadish, J. M. Patel, Y. Wu, N. Koudas, and D. Srivastava. Structural joins: A primitive for efficient XML query pattern matching. In Proc. of the 2002 Intl. Conf. on Data Engineering, pages 141--154, 2002.
[4]
N. Bruno, N. Koudas, and D. Srivastava. Holistic twig joins: optimal XML pattern matching. In Proc. of the 2002 ACM SIGMOD, 2002.
[5]
Y. Caseau. Efficient handling of multiple inheritance hierarchies. In Proc. of the 1993 Conf. on Object-oriented Programming Systems, Languages, and Applications, 1993.
[6]
Online Computer Library Center. Dewey decimal classification. http://www.oclc.org/dewey/.
[7]
Q. Chen, A. Lim, and K. W. Ong. D(k)-index: An adaptive structural summary for graph-structured data. In Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data, June 2003.
[8]
V. Christophides, D. Plexousakis, and et al. On labeling schemes for the semantic web. In Proc. of the 12th Intl. Conf. on WWW, 2003.
[9]
E. Cohen, E. Halperin, H. Kaplan, and U. Zwick. Reachability and distance queries via 2-hop labels. In Proc. of the 13th ACM-SIAM SODA, 2002.
[10]
P. F. Dietz. Maintaining order in a linked list. In Proceedings of the 14th ACM symposium on Theory of computing, 1982.
[11]
Frank Olken et al. The Biopathways Graph Data Manager project. http://pueblo.lbl.gov/~olken/graphdm/ graphdm.htm.
[12]
H. He, H. Wang, J. Yang, and Philip S. Yu. Compact reachability labeling for graph-structured data. Technical report, Duke University, Nov. 2004. http://www.cs.duke.edu/~haohe/research/report.pdf.
[13]
T. Kameda. On the vector representation of the reachability in planar directed graphs. Information Processing Letters, 3(3), January 1975.
[14]
M. Katz, N. A. Katz, and et al. Labeling schemes for flow and connectivity. In Proc. of the 13th ACM-SIAM SODA, 2002.
[15]
R. Kaushik, P. Bohannon, J. F. Naughton, and H. F. Korth. Covering indexes for branching path queries. In Proc. of the 2002 ACM SIGMOD Intl. Conf. on Management of Data, June 2002.
[16]
N. Polyzotis and M. N. Garofalakis. Structure and value synopses for XML data graphs. In Proc. of the 28th VLDB, pages 466--477, 2002.
[17]
A. Sayed and R. Unland. Hid: An efficient path index for complex xml collections with arbitrary links. In DNIS, pages 78--91, 2005.
[18]
SRI. The BioCyc project. http://biocyc.org/.
[19]
Z. Vagena, M. Moro, and V. Tsotras. Twig query processing over graph-structured XML data. In Proc. of the 7th Intl. Workshop on Web and Databases, 2004.
[20]
C. Zhang, J. Naughton, and et al. On supporting containment queries in relational database management systems. In Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data, 2001.

Cited By

View all
  • (2023)IFCA: Index-Free Community-Aware Reachability Processing Over Large Dynamic Graphs2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00172(2220-2234)Online publication date: Apr-2023
  • (2022)Providing Fast Reachability Query Services With MGTag: A Multi-Dimensional Graph Labeling MethodIEEE Transactions on Services Computing10.1109/TSC.2020.296989815:2(1000-1011)Online publication date: 1-Mar-2022
  • (2020)Ease.ml/snoopy in actionProceedings of the VLDB Endowment10.14778/3415478.341548813:12(2837-2840)Online publication date: 14-Sep-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management
October 2005
854 pages
ISBN:1595931406
DOI:10.1145/1099554
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 October 2005

Permissions

Request permissions for this article.

Check for updates

Author Tag

  1. graph reachability labeling

Qualifiers

  • Article

Conference

CIKM05
Sponsor:
CIKM05: Conference on Information and Knowledge Management
October 31 - November 5, 2005
Bremen, Germany

Acceptance Rates

CIKM '05 Paper Acceptance Rate 77 of 425 submissions, 18%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)IFCA: Index-Free Community-Aware Reachability Processing Over Large Dynamic Graphs2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00172(2220-2234)Online publication date: Apr-2023
  • (2022)Providing Fast Reachability Query Services With MGTag: A Multi-Dimensional Graph Labeling MethodIEEE Transactions on Services Computing10.1109/TSC.2020.296989815:2(1000-1011)Online publication date: 1-Mar-2022
  • (2020)Ease.ml/snoopy in actionProceedings of the VLDB Endowment10.14778/3415478.341548813:12(2837-2840)Online publication date: 14-Sep-2020
  • (2020)COUNTATAProceedings of the VLDB Endowment10.14778/3415478.341548613:12(2829-2832)Online publication date: 14-Sep-2020
  • (2020)IMOProceedings of the VLDB Endowment10.14778/3415478.341548513:12(2825-2828)Online publication date: 14-Sep-2020
  • (2020)CheetahVISProceedings of the VLDB Endowment10.14778/3415478.341548013:12(2805-2808)Online publication date: 1-Aug-2020
  • (2020)Collecting and analyzing data jointly from multiple services under local differential privacyProceedings of the VLDB Endowment10.14778/3407790.340785913:12(2760-2772)Online publication date: 14-Sep-2020
  • (2017)An Indexing Framework for Queries on Probabilistic GraphsACM Transactions on Database Systems10.1145/304471342:2(1-34)Online publication date: 10-May-2017
  • (2017)HD-GDDWorld Wide Web10.1007/s11280-016-0407-z20:4(677-696)Online publication date: 1-Jul-2017
  • (2016)Efficient Subgraph Matching in Large Graph with Partitioning Scheme2016 13th Web Information Systems and Applications Conference (WISA)10.1109/WISA.2016.16(28-33)Online publication date: Sep-2016
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media