Skip to main content

Graph Representations and Storage

  • Living reference work entry
  • First Online:
Encyclopedia of Big Data Technologies
  • 436 Accesses

Definitions

Graph representation concerns the layout of graph data in the linearly addressed storage of computers (main memory, SSD, hard disk, etc.). General objectives for graph representations are (1) to use little storage space (space-efficiency) while (2) allowing operations and queries on the graph to be executed in a short time (time-efficiency). Hence, every graph representation technique is subject to a specific space–time trade-off. It depends on the particular scenario (data, workload, available resources, etc.) whether the space–time trade-off of a certain representation technique is worthwhile.

Overview

Graph data comes with a large variation of graph data models, with RDF and the property graph model (PGM) being the most prominent ones. The specific graph representation depends on the data model. We introduce the most important concepts for graph representations as data–model-agnostic as possible. The discussion is structured in (1) primary representation of the graph...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • Abadi DJ, Marcus A, Madden S, Hollenbach KJ (2007) Scalable semantic web data management using vertical partitioning. In: VLDB. ACM, pp 411–422

    Google Scholar 

  • Aberger CR, Tu S, Olukotun K, Ré C (2016) EmptyHeaded: a relational engine for graph processing. In: SIGMOD, pp 431–446. https://doi.org/10.1145/2882903.2915213

  • Apostolico A, Drovandi G (2009) Graph compression by BFS. Algorithms 2(3):1031–1044. https://doi.org/10.3390/a2031031

  • Bast H, Delling D, Goldberg AV, Müller-Hannemann M, Pajor T, Sanders P, Wagner D, Werneck RF (2016) Route planning in transportation networks. In: Algorithm engineering – selected results and surveys, vol 9220, pp 19–80. https://doi.org/10.1007/978-3-319-49487-6_2

  • Blandford DK, Blelloch GE, Kash IA (2004) An experimental analysis of a compact graph representation. In: ALENEX. SIAM, pp 49–61

    Google Scholar 

  • Boldi P, Vigna S (2004) The webGraph framework I: compression techniques. In: WWW. ACM, pp 595–602. https://doi.org/10.1145/988672.988752

  • Boldi P, Rosa M, Santini M, Vigna S (2011) Layered label propagation: a multiResolution coordinate-free ordering for compressing social networks. In: WWW. ACM, pp 587–596. https://doi.org/10.1145/1963405.1963488

  • Bornea MA, Dolby J, Kementsietsidis A, Srinivas K, Dantressangle P, Udrea O, Bhattacharjee B (2013) Building an efficient RDF store over a relational database. In: SIGMOD. ACM, pp 121–132. https://doi.org/10.1145/2463676.2463718

  • Brisaboa NR, Ladra S, Navarro G (2014) Compact representation of web graphs with extended functionality. Inf Syst 39:152–174. https://doi.org/10.1016/j.is.2013.08.003

  • Chen R, Shi J, Chen Y, Chen H (2015) PowerLyra: differentiated graph computation and partitioning on skewed graphs. In: EuroSys. ACM, pp 1:1–1:15. https://doi.org/10.1145/2741948.2741970

  • Chong EI, Das S, Eadon G, Srinivasan J (2005) An efficient SQL-based RDF querying scheme. In: VLDB. ACM, pp 1216–1227

    Google Scholar 

  • Chu E, Beckmann JL, Naughton JF (2007) The case for a wide-table approach to manage sparse relational data sets. In: SIGMOD. ACM, pp 821–832. https://doi.org/10.1145/1247480.1247571

  • Claude F, Navarro G (2010a) Extended compact web graph representations. In: Algorithms and applications, essays dedicated to Esko Ukkonen on the occasion of his 60th birthday. Springer, pp 77–91. https://doi.org/10.1007/978-3-642-12476-1_5

  • Claude F, Navarro G (2010b) Fast and compact web graph representations. ACM Trans Web 4(4):16:1–16:31. https://doi.org/10.1145/1841909.1841913

  • Cormen TH, Leiserson CE, Rivest R, Stein C (2001) Introduction to algorithms, 2nd edn. MIT Press, Cambridge

    Google Scholar 

  • Crockford D (2006) The application/json media type for JavaScript object notation (JSON), RFC 4627. http://tools.ietf.org/html/rfc4627

  • Erling O, Mikhailov I (2007) RDF support in the virtuoso DBMS. In: CSSW, GI, vol 113, pp 59–68

    Google Scholar 

  • Grabowski S, Bieniecki W (2011) Merging adjacency lists for efficient web graph compression. In: ICMMI. Springer, pp 385–392. https://doi.org/10.1007/978-3-642-23169-8_42

  • Gustavson FG (1978) Two fast algorithms for sparse matrices: multiplication and permuted transposition. Trans Math Softw 4(3):250–269. https://doi.org/10.1145/355791.355796

  • Hauck M, Paradies M, Fröning H, Lehner W, Rauhe H (2015) Highspeed graph processing exploiting main-memory column stores. In: Euro-Par, pp 503–514. https://doi.org/10.1007/978-3-319-27308-2_41

  • Herrmann K, Voigt H, Lehner W (2014a) Cinderella – adaptive online partitioning of irregularly structured data. In: SMDB. IEEE Computer Society, pp 284–291. https://doi.org/10.1109/ICDEW.2014.6818342

  • Herrmann K, Voigt H, Lehner W (2014b) Online horizontal partitioning of heterogeneous data. It – Inf Technol 56(1):4–12. https://doi.org/10.1515/itit-2014-1015

  • Khayyat Z, Awara K, Alonazi A, Jamjoom H, Williams D, Kalnis P (2013) Mizan: a system for dynamic load balancing in large-scale graph processing. In: EuroSys. ACM, pp 169–182. https://doi.org/10.1145/2465351.2465369

  • Krause A, Kissinger T, Habich D, Voigt H, Lehner W (2017) Partitioning strategy selection for in-memory graph pattern matching on multiprocessor systems. In: Euro-Par

    Google Scholar 

  • LeBeane M, Song S, Panda R, Ryoo JH, John LK (2015) Data partitioning strategies for graph workloads on heterogeneous clusters. In: SC. ACM, pp 56:1–56:12. https://doi.org/10.1145/2807591.2807632

  • MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: BSMSP. University of California Press, vol 1, pp 281–297

    Google Scholar 

  • Nenov Y, Piro R, Motik B, Horrocks I, Wu Z, Banerjee J (2015) RDFox: a highly-scalable RDF store. In: ISWC. Springer, vol 9367, pp 3–20. https://doi.org/10.1007/978-3-319-25010-6_1

  • Neumann T, Weikum G (2008) RDF-3X: a RISC-style engine for RDF. PVLDB 1(1):647–659

    Google Scholar 

  • Paradies M, Lehner W, Bornhövd C (2015) GRAPHITE: an extensible graph traversal framework for relational database management systems. In: SSDBM. ACM, pp 29:1–29:12. https://doi.org/10.1145/2791347.2791383

  • Pham M, Passing L, Erling O, Boncz PA (2015) Deriving an emergent relational schema from RDF data. In: WWW. ACM, pp 864–874. https://doi.org/10.1145/2736277.2741121

  • Rudolf M, Paradies M, Bornhövd C, Lehner W (2013) The graph story of the SAP HANA database. In: BTW, GI, vol 214, pp 403–420

    Google Scholar 

  • Shun J, Blelloch GE (2013) Ligra: a lightweight graph processing framework for shared memory. In: PPoPP, pp 135–146. https://doi.org/10.1145/2442516.2442530

  • Sommer C (2014) Shortest-path queries in static networks. ACM Comput Surv 46(4):45:1–45:31. https://doi.org/10.1145/2530531

  • Su J, Zhu Q, Wei H, Yu JX (2017) Reachability querying: can it be even faster? IEEE TKDE 29(3):683–697. https://doi.org/10.1109/TKDE.2016.2631160

  • Sun W, Fokoue A, Srinivas K, Kementsietsidis A, Hu G, Xie GT (2015) SQLGraph: an efficient relational-based property graph store. In: SIGMOD. ACM, pp 1887–1901. https://doi.org/10.1145/2723372.2723732

  • Verma S, Leslie LM, Shin Y, Gupta I (2017) An experimental comparison of partitioning strategies in distributed graph processing. PVLDB 10(5):493–504

    Google Scholar 

  • Weiss C, Karras P, Bernstein A (2008) Hexastore: sextuple indexing for semantic web data management. PVLDB 1(1):1008–1019

    Google Scholar 

  • Yu JX, Cheng J (2010) Graph reachability queries: a survey. In: Managing and mining graph data, vol 40. Springer, pp 181–215. https://doi.org/10.1007/978-1-4419-6045-0_6

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Marcus Paradies or Hannes Voigt .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Paradies, M., Voigt, H. (2018). Graph Representations and Storage. In: Sakr, S., Zomaya, A. (eds) Encyclopedia of Big Data Technologies. Springer, Cham. https://doi.org/10.1007/978-3-319-63962-8_211-1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63962-8_211-1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63962-8

  • Online ISBN: 978-3-319-63962-8

  • eBook Packages: Springer Reference MathematicsReference Module Computer Science and Engineering

Publish with us

Policies and ethics