Abstract
The Web is evolving from a Web of Documents to a Web of Data. Meanwhile, the development of Semantic Web applications opens the way for addressing complex information needs. In this scenario, clustering is identified as a crucial task for semantic mashups. After a thorough review of RDF clustering techniques, the paper addresses the open issues within the efficient exploitation of the knowledge contained in RDF data sources. Then, first promising attempts in exploring the applicability of community detection algorithms for RDF clustering are reported.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berners-Lee, T.: Linked data (2006), http://www.w3.org/designissues/linkeddata.html
Hausenblas, M., Halb, W., Raimond, Y., Heath, T.: What is the size of the semantic web? In: Proceedings of I-Semantics, pp. 9–16 (2008)
Bizer, C., Heath, T., Idehen, K., Berners-Lee, T.: Linked data on the web (ldow2008). In: Proceedings of the 17th International Conference on World Wide Web, pp. 1265–1266. ACM (2008)
Tran, T., Wang, H., Haase, P.: Hermes: Data web search on a pay-as-you-go integration infrastructure. Web Semantics: Science, Services and Agents on the World Wide Web 7(3), 189–203 (2009)
Zeng, K., Yang, J., Wang, H., Shao, B., Wang, Z.: A distributed graph engine for web scale rdf data. In: Proceedings of the 39th International Conference on Very Large Data Bases, pp. 265–276. VLDB Endowment (2013)
Kaushik, R., Shenoy, P., Bohannon, P., Gudes, E.: Exploiting local similarity for indexing paths in graph-structured data. In: Proceedings of the 18th International Conference on Data Engineering, pp. 129–140. IEEE (2002)
Wu, A.Y., Garland, M., Han, J.: Mining scale-free networks using geodesic clustering. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 719–724. ACM (2004)
Konrath, M., Gottron, T., Staab, S., Scherp, A.: Schemexefficient construction of a data catalogue by stream-based indexing of linked data. Web Semantics: Science, Services and Agents on the World Wide Web 16, 52–58 (2012)
Böhm, C., Lorey, J., Naumann, F.: Creating void descriptions for web-scale data. Web Semantics: Science, Services and Agents on the World Wide Web 9(3), 339–345 (2011)
Algergawy, A., Massmann, S., Rahm, E.: A clustering-based approach for large-scale ontology matching. In: Eder, J., Bielikova, M., Tjoa, A.M. (eds.) ADBIS 2011. LNCS, vol. 6909, pp. 415–428. Springer, Heidelberg (2011)
Fortunato, S.: Community detection in graphs. Physics Reports 486(3), 75–174 (2010)
W3C: http://www.w3.org/tr/r2rml/ (September 27, 2012)
W3C: http://www.w3.org/tr/rdfa-lite/ (June 07, 2012)
Augenstein, I., Padó, S., Rudolph, S.: LODifier: Generating linked data from unstructured text. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 210–224. Springer, Heidelberg (2012)
Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment 2008(10), P10008 (2008)
Tran, T., Cimiano, P., Rudolph, S., Studer, R.: Ontology-based interpretation of keywords for semantic search. In: Aberer, K., et al. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 523–536. Springer, Heidelberg (2007)
Evans, T., Lambiotte, R.: Line graphs, link partitions, and overlapping communities. Physical Review EÂ 80(1), 016105 (2009)
Ahn, Y.Y., Bagrow, J.P., Lehmann, S.: Link communities reveal multiscale complexity in networks. Nature 466(7307), 761–764 (2010)
Schmidt, M., Hornung, T., Lausen, G., Pinkel, C.: Sp2bench: a sparql performance benchmark. In: IEEE 25th International Conference on Data Engineering, ICDE 2009, pp. 222–233. IEEE (2009)
Ravasz, E., Somera, A.L., Mongru, D.A., Oltvai, Z.N., Barabási, A.L.: Hierarchical organization of modularity in metabolic networks. Science 297(5586), 1551–1555 (2002)
Fanizzi, N., d’Amato, C.: A hierarchical clustering method for semantic knowledge bases. In: Apolloni, B., Howlett, R.J., Jain, L. (eds.) KES 2007, Part III. LNCS (LNAI), vol. 4694, pp. 653–660. Springer, Heidelberg (2007)
Grimnes, G.A., Edwards, P., Preece, A.D.: Instance based clustering of semantic web resources. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 303–317. Springer, Heidelberg (2008)
Grimnes, G.A., Edwards, P., Preece, A.D.: Learning meta-descriptions of the FOAF network. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 152–165. Springer, Heidelberg (2004)
Lösch, U., Bloehdorn, S., Rettinger, A.: Graph kernels for RDF data. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 134–148. Springer, Heidelberg (2012)
Maedche, A., Zacharias, V.: Clustering ontology-based metadata in the semantic web. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, pp. 348–360. Springer, Heidelberg (2002)
Delteil, A., Faron-Zucker, C., Dieng, R.: Learning ontologies from rdf annotations. In: Workshop on Ontology Learning (2001)
Rattigan, M.J., Maier, M., Jensen, D.: Graph clustering with network structure indices. In: Proceedings of the 24th International Conference on Machine Learning, pp. 783–790. ACM (2007)
Alzogbi, A., Lausen, G.: Similar structures inside rdf-graphs (2013)
Zhou, Y., Cheng, H., Yu, J.X.: Graph clustering based on structural/attribute similarities. Proceedings of the VLDB Endowment 2(1), 718–729 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Giannini, S. (2013). RDF Data Clustering. In: Abramowicz, W. (eds) Business Information Systems Workshops. BIS 2013. Lecture Notes in Business Information Processing, vol 160. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41687-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-41687-3_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41686-6
Online ISBN: 978-3-642-41687-3
eBook Packages: Computer ScienceComputer Science (R0)