Object Clustering in Linked Data Using Centrality

Zhang, Xiang; Lv, Yulian; Lin, Erjing

doi:10.1007/978-981-10-3168-7_17

Xiang Zhang¹⁶,
Yulian Lv¹⁷ &
Erjing Lin¹⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 650))

Included in the following conference series:

China Conference on Knowledge Graph and Semantic Computing

1440 Accesses

Abstract

Large-scale linked data is becoming a challenge to many Semantic Web tasks. While clustering of graphs has been deeply researched in network science and machine learning, not many researches are carried on clustering in linked data. To identify meta-structures in large-scale linked data, the scalability of clustering should be considered. In this paper, we propose a scalable approach of centrality-based clustering, which works on a model of Object Graph derived from RDF graph. Centrality of objects is calculated as indicators for clustering. Both relational and linguistic closeness between objects are considered in clustering to produce coherent clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
SWCC: http://data.semanticweb.org/.
2.
JAME: http://dbtune.org/jamendo/.
3.
LMDB: http://linkedmdb.org/.
4.
Gephi: https://gephi.org/users/download/.

References

Hartig, O., Bizer, C., Freytag, J.-C.: Executing SPARQL queries over the web of linked data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 293–309. Springer, Heidelberg (2009). doi:10.1007/978-3-642-04930-9_19
Chapter Google Scholar
Paulheim, H.: Exploiting linked open data as background knowledge in data mining. In: Proceedings of International Workshop on Data Mining on Linked Data, with Linked Data Mining Challenge Collocated with ECMLPKDD 2013, pp. 1–10 (2013)
Google Scholar
Qu, Y., Hu, W., Cheng, G.: Constructing virtual documents for ontology matching. In: Proceedings of 15th International Conference on World Wide Web (WWW 2006), pp. 23–31 (2006)
Google Scholar
Newman, M.E.J.: A measure of betweenness centrality based on random walks. Soc. Netw. 27, 39–54 (2005)
Article Google Scholar
Zhang, X., Cheng, G., Qu, Y.: Ontology summarization based on RDF sentence graph. In: Proceedings of 16th International Conference on World Wide Web – WWW 2007, p. 707 (2007)
Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical report, Stanford Digital Library Technologies Project (1998)
Google Scholar
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46, 668–677 (1999)
Article MathSciNet MATH Google Scholar
Lempel, R., Moran, S.: Stochastic approach for link-structure analysis (SALSA) and the TKC effect. Comput. Netw. 33, 387–401 (2000)
Article Google Scholar
Sheskin, D.J.: Handbook of parametric and nonparametric statistical procedures. Technometrics 46, 1193 (2004)
MATH Google Scholar
Schlicht, A., Stuckenschmidt, H.: Towards structural criteria for ontology modularization. In: CEUR Workshop Proceedings (2006)
Google Scholar
Grimnes, G.A.A, Edwards, P., Preece, A.: Instance based clustering of semantic web resources. In: Proceedings of 5th European Semantic Web Conference on the Semantic Web: Research and Applications, pp. 303–317 (2008)
Google Scholar
Yan, Y., Wang, C., Zhou, A., Qian, W., Ma, L., Pan, Y.: Efficient indices using graph partitioning in RDF triple stores. In: Proceedings - International Conference on Data Engineering, pp. 1263–1266 (2009)
Google Scholar
Aluç, G., Özsu, M.T., Daudjee, K.: Clustering RDF databases using tunable-LSH. pp. 1–13, CoRR, abs/1504.02523 (2015)
Google Scholar
Tabrizi, S.A., Shakery, A., Asadpour, M., Abbasi, M., Tavallaie, M.A.: Personalized PageRank clustering: a graph clustering algorithm based on random walks. Phys. A Stat. Mech. Appl. 392, 5772–5785 (2013)
Article MathSciNet Google Scholar

Download references

Acknowledgement

The work was supported by the National High-Tech Research and Development (863) Program of China (No. 2015AA015406) and the Open Project of Jiangsu Key Laboratory of Data Engineering and Knowledge Service (No. DEKS2014KT002).

Author information

Authors and Affiliations

School of Computer Science and Engineering, Southeast University, Nanjing, China
Xiang Zhang & Erjing Lin
College of Software Engineering (Suzhou), Southeast University, Suzhou, China
Yulian Lv

Authors

Xiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yulian Lv
View author publications
You can also search for this author in PubMed Google Scholar
Erjing Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiang Zhang .

Editor information

Editors and Affiliations

Zhejiang University, Zhejiang, China
Huajun Chen
Rensselaer Polytechnic Institute, Troy, New York, USA
Heng Ji
Chinese Academy of Sciences, Beijing, China
Le Sun
Google Research, Mountain View, California, USA
Haixun Wang
Wuhan University, Wuhan, Hubei, China
Tieyun Qian
East China University of Science and Technology, Shanghai, China
Tong Ruan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, X., Lv, Y., Lin, E. (2016). Object Clustering in Linked Data Using Centrality. In: Chen, H., Ji, H., Sun, L., Wang, H., Qian, T., Ruan, T. (eds) Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data. CCKS 2016. Communications in Computer and Information Science, vol 650. Springer, Singapore. https://doi.org/10.1007/978-981-10-3168-7_17

Download citation

DOI: https://doi.org/10.1007/978-981-10-3168-7_17
Published: 23 November 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3167-0
Online ISBN: 978-981-10-3168-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics