Abstract
Linked Open Data initiatives have made available a diversity of collections that domain experts have annotated with controlled vocabulary terms from ontologies. We identify annotation signatures of linked data that associate semantically similar concepts, where similarity is measured in terms of shared annotations and ontological relatedness. Formally, an annotation signature is a partition or clustering of the links that represent the relationships between shared annotations. A clustering algorithm named AnnSigClustering is proposed to generate annotation signatures. Evaluation results over drug and disease datasets demonstrate the effectiveness of using annotation signatures to identify patterns among entities in the same cluster of a signature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Benik, J., Chang, C., Raschid, L., Vidal, M.-E., Palma, G., Thor, A.: Finding cross genome patterns in annotation graphs. In: Bodenreider, O., Rance, B. (eds.) DILS 2012. LNCS, vol. 7348, pp. 21–36. Springer, Heidelberg (2012)
Bhagwani, S., Satapathy, S., Karnick, H.: Semantic textual similarity using maximal weighted bipartite graph matching. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, pp. 579–585. Association for Computational Linguistics (2012)
Cohen, W.W., Ravikumar, P.D., Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. In: IIWeb, pp. 73–78 (2003)
Cook, D.J., Holder, L.B.: Mining graph data. Wiley-Blackwell (2007)
Jaro, M.A.: Probabilistic linkage of large public health data files. In: Statistics in Medicine, pp. 491–498 (1995)
Pekar, V., Staab, S.: Taxonomy learning - factoring the structure of a taxonomy into a semantic classification decision. In: COLING (2002)
Shi, C., Kong, X., Yu, P.S., Xie, S., Wu, B.: Relevance search in heterogeneous networks. In: EDBT, pp. 180–191 (2012)
Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. PVLDB 4(11), 992–1003 (2011)
Sun, Y., Han, J., Zhao, P., Yin, Z., Cheng, H., Wu, T.: Rankclus: integrating clustering with ranking for heterogeneous information network analysis. In: EDBT, pp. 565–576 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Palma, G., Vidal, ME., Raschid, L., Thor, A. (2014). Exploiting Semantics from Ontologies and Shared Annotations to Partition Linked Data. In: Galhardas, H., Rahm, E. (eds) Data Integration in the Life Sciences. DILS 2014. Lecture Notes in Computer Science(), vol 8574. Springer, Cham. https://doi.org/10.1007/978-3-319-08590-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-08590-6_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08589-0
Online ISBN: 978-3-319-08590-6
eBook Packages: Computer ScienceComputer Science (R0)