Abstract
Shortest distance computation is one of the widely researched areas in theoretical computer science and graph databases. Distance labeling are well-known for improving the performance of shortest distance queries. One of the best distance labeling approaches is Pruned Landmark Labeling (PLL). PLL is a 2-hop distance labeling which prunes a lot of unnecessary labels while doing breadth-first-search. Another well-known 2-hop labeling is Pruned Highway Labeling (PHL) which is designed for undirected road networks. Both PLL and PHL suffer from the problem of large index size. In this paper, we propose two approaches to address the problem, one is to compress the PLL index as well as the graph for directed graphs; the other is to compress undirected road networks using linear sets, which are essentially maximal-length non-branching paths. Our aim is to reduce the index size and index construction time without significantly compromising query performance. Extensive experiments with real world datasets confirm the effectiveness of our approaches.
Similar content being viewed by others
References
Abraham, I., Delling, D., Goldberg, A.V., Werneck, R.F.F.: Hierarchical hub labelings for shortest paths. In: Epstein, L., Ferragina, P. (eds.) Algorithms - ESA 2012 - 20th Annual European Symposium, Ljubljana, Slovenia, September 10-12, 2012. Proceedings, Lecture Notes in Computer Science, vol. 7501, pp 24–35 (2012)
Akiba, T., Iwata, Y., Kawarabayashi, K., Kawata, Y.: Fast shortest-path distance queries on road networks by pruned highway labeling. In: ALENEX, pp 47–154 (2014)
Akiba, T., Iwata, Y., Yoshida, Y.: Fast exact shortest-path distance queries on large networks by pruned landmark labeling. In: SIGMOD. https://doi.org/10.1145/2463676.2465315, pp 349–360 (2013)
Akiba, T., Sommer, C., Kawarabayashi, K.: Shortest-path queries for complex networks: exploiting low tree-width outside the core. In: EDBT. https://doi.org/10.1145/2247596.2247614, pp 144–155 (2012)
Anirban, S., Wang, J., Islam, M.S.: Modular decomposition-based graph compression for fast reachability detection. Data Sci. Eng. 4, 193–207 (2019). https://doi.org/10.1007/s41019-019-00099-9
Chang, L., Yu, J.X., Qin, L., Cheng, H., Qiao, M.: The exact distance to destination in undirected world. VLDB J. 21, 869–888 (2012). https://doi.org/10.1007/s00778-012-0274-x
Chen, W., Sommer, C., Teng, S., Wang, Y.: A compact routing scheme and approximate distance oracle for power-law graphs. TALG 9, 1–26 (2012). https://doi.org/10.1145/2390176.2390180
Cohen, E., Halperin, E., Kaplan, H., Zwick, U.: Reachability and distance queries via 2-hop labels. SIAM, 937–946 (2002)
Delling, D., Goldberg, A, Werneck, R: Hub label compression. In: Proceedings of the 12th International Symposium on Experimental Algorithms (SEA’13). https://www.microsoft.com/en-us/research/publication/hub-label-compression/. Springer Verlag (2013)
Delling, D.: Route planning in transportation networks: from research to practice. In: SIGSPATIAL. https://doi.org/10.1145/3274895.3282802, p 2 (2018)
Fan, W.: Data quality: Theory and practice. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds.) Web-Age Information Management - 13th International Conference, WAIM 2012, Harbin, China, August 18-20. Proceedings, Lecture Notes in Computer Science, vol. 7418, pp 1–16 (2012)
Farhan, M., Wang, Q., Lin, Y., McKay, B.: A highly scalable labelling approach for exact distance queries in complex networks. In: EDBT, pp 13–24 (2019)
Farhan, M., Wang, Q., Lin, Y., McKay, B.D.: A highly scalable labelling approach for exact distance queries in complex networks. In: Advances in Database Technology - 22nd International Conference on Extending Database Technology, EDBT 2019, Lisbon, Portugal, March 26-29, 2019, pp 13–24 (2019)
chee Fu, A.W., Wu, H., Cheng, J., Wong, R.C.W.: Is-label: an independent-set based labeling scheme for point-to-point distance querying. In: PVLDB. https://doi.org/10.14778/2536336.2536346, vol. 6, pp 457–468 (2013)
Haryanto, A.A., Islam, M.S., Taniar, D., Cheema, M.A.: Ig-tree: an efficient spatial keyword index for planning best path queries on road networks. In: World Wide Web. https://doi.org/10.1007/s11280-018-0643-5, vol. 22, pp 1359–1399 (2019)
Hayashi, T., Akiba, T., Kawarabayashi, K.: Fully dynamic shortest-path distance query acceleration on massive networks. In: CIKM. https://doi.org/10.1145/2983323.2983731, pp 1533–1542 (2016)
Jiang, M., Fu, A., Wong, R.C.W., Xu, Y.: Hop doubling label indexing for point-to-point distance querying on scale-free networks. In: PVLDB. https://doi.org/10.14778/2732977.2732993, vol. 7, pp 1203–1214 (2014)
Jiang, M., Fu, A.W., Wong, R.C., Xu, Y.: Hop doubling label indexing for point-to-point distance querying on scale-free networks. PVLDB 7(12), 1203–1214 (2014)
Li, W., Qiao, M., Qin, L., Zhang, Y., Chang, L., Lin, X.: Scaling distance labeling on small-world networks. In: SIGMOD. https://doi.org/10.1145/3299869.3319877, pp 060–1077 (2019)
Li, W., Qiao, M., Qin, L., Zhang, Y., Chang, L., Lin, X.: Scaling up distance labeling on graphs with core-periphery properties. In: Maier, D., Pottinger, R., Doan, A., Tan, W., Alawini, A., Ngo, H.Q. (eds.) Proceedings of the 2020 International Conference on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 14-19, 2020, pp 1367–1381. ACM (2020)
Li, Y., U, L.H., Yiu, M., Kou, N.: An experimental study on hub labeling based shortest path algorithms. In: PVLDB. https://doi.org/10.1145/3186728.3164141, vol. 11, pp 445–457 (2017)
Potamias, M., Bonchi, F., Castillo, C., Gionis, A.: Fast shortest path distance estimation in large networks. In: CIKM. https://doi.org/10.1145/1645953.1646063, pp 867–876 (2009)
Qiao, M., Cheng, H., Chang, L., Yu, J.X.: Approximate shortest distance computing: A query-dependent local landmark scheme. TKDE 26, 55–68 (2012). https://doi.org/10.1109/TKDE.2012.253
Ren, X., Sengupta, N., Ren, X., Wang, J., Curé, O.: Finding minimum connected subgraphs with ontology exploration on large RDF data. arXiv:2010.06336 (2020)
Ren, X., Wang, J.: Exploiting vertex relationships in speeding up subgraph isomorphism over large graphs. Proc. VLDB Endow. 8(5), 617–628 (2015)
Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: AAAI. https://networkrepository.com (2015)
Shi, Y., Cheng, G., Kharlamov, E.: Keyword search over knowledge graphs via static and dynamic hub labelings. In: WWW, pp 235–245 (2020)
Wang, J., Anirban, S., Amagasa, T., Shiokawa, H., Gong, Z., Islam, M.S.: A hybrid index for distance queries. In: WISE, pp 227–241 (2020)
Wang, X., Zhang, Y., Zhang, W., Lin, X.: Efficient distance-aware influence maximization in geo-social networks. TKDE 29, 599–612 (2017). https://doi.org/10.1109/TKDE.2016.2633472
Yahia, S.A., Benedikt, M., Lakshmanan, L.V., Stoyanovich, J.: Efficient network aware search in collaborative tagging sites. In: PVLDB. https://doi.org/10.14778/1453856.1453934, vol. 1, pp 710–721 (2008)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no financial or non-financial interests to disclose that are relevant to the content of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Anirban, S., Wang, J., Islam, M.S. et al. Compression techniques for 2-hop labeling for shortest distance queries. World Wide Web 25, 151–174 (2022). https://doi.org/10.1007/s11280-021-00977-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-021-00977-1